INTERNATIO 
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ational application No. 

PCT/US 99/29824 



Box I Observation where ertain laimsw r found unsearchable (Continuation of it ml of first sh t) 

This International Search Report has not been established in respect of certain claims under Article 17{2)(a) for the following reasons: 
1 . Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. I Claims Nos.: 

— because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. J Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this intemationaJ application, as follows: 



1 . I I As alt required additional search fees were timely paid by the applicant, this International Search Report covers ail 
' ' searchable claims. 

2. I I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid, specifically claims Nos.: 



4. [ Y I No required additional search fees were timely paid by the applicant. Consequently, this InternationaJ Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



1-23 ALL PARTLY 



Remark on Protest | [ The additional search fees were accompanied by the applicant's protest. 

I I No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (1)) (July 1998) 



International Application No. PCT/US 99/29824 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 21Q 



1. Claims: 1-23 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 1, 2, 23, 24. 



2. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 3, 4. 



3. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 5, 5. 



4. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 7, 8. 



5. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 9, 10. 



6. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 11, 12. 



7. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 13, 14. 



page 1 of 2 



International Application No. PCT/US 99/29824 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



8. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 15, 16. 

9. Claims: 1-23 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 17, 18, 25, 26. 

10. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 19, 20. 

11. Claims: 1-23 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 21, 22, 27, 28. 



page 2 of 2 



INTERNATIO 




SEARCH REPORT 



Intern ^ Ma! Application No 

PCT/US 99/29824 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C12N15/64 C12N9/12 
C12Q1/48 



C07K14/415 C12N5/10 



C12Q1/68 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N 



Documentation searched other than minimum documentation to the extent that such documents are included In the fields searched 



Electronic data base consutted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication, where appropriate, of the relevant passages 

SANO ET AL.: "Light anci nutritional 
regulation of transcripts encoding a wheat 
protein kinase homolog is mediated by 
cytokinins" 

PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCE USA, 

vol. 91, March 1994 (1994-03), pages 
2582-2586, XP002137533 
cited in the application 
abstract; figure 1 

-/-- 



Relevant to claim No. 



Further documents are listed in the continuation of box C. 



1,2,10, 
22,23 



Patent family members are listed in annex. 



** Special categories of cited documents : 

"A' document defining the general state of the art which is not 

considered to be of particular relevance 
'E' earlier document but published on or after the intemational 

filing date 

'L' document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

■O' document referring to an oral disclosure, use, exhibition or 
other means 

•P' document published prior to the international filing date but 
later than the priority date claimed 



Date of the actual completion of the international search 



15 May 2000 

Name and mailing address of the ISA 

European Patent Office, P.B. 581 8 Patenttaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 

Form PCT/tSA/210 (second sheet) ^uty 1992) 



"P later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X' document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of mailing of the Intemational search report 



^ 6. 8 00 



Authorized officer 



CEDER 0. 
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Intem^^^^ial Application No 

PCT/US 99/29824 



C.(Conttnuation) DOCUMENTS CONSIDERED TO BE RELEVANT 

Category ' I Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



HOTTA H ET AL: "Molecular analysis of a 
novel protein kinase in maturing rice 
seed" 
GENE, 

vol. 213, no. 1-2, 

15 June 1998 (1998-06-15), pages 47-54, 
XP0O4125002 

ISSN: G378-1119 
abstract 



AL 



"SNFl-related protein 



TSAI ET 
kinase" 

EMBL SEQUENCE DATABASE, 

1 November 1996 (1996-11-01), XP002137634 

HEIDELBERG DE 

Ac Q40740 

the whole document 

TSAI ET AL.: "Oryza sativa SNFl-related 

protein kinase (RSkl) mRNA, complete cds" 

EMBL SEQUENCE DATABASE, 

3 August 1996 (1996-08-03), XP002137635 

HEIDELBERG DE 

Ac U55768 

the whole document 

ALDERSON ET AL. : "Complementation of 
SNFl, a mutation affecting global 
regulation of carbon metabolim in yeast, 
by a plant protein kinase cONA" 
PROCEEDINGS OF THE NATIONAL ACADEMY OF ' 
SCIENCE USA, 

vol. 88, October 1991 (1991-10), pages 
8602-8605, XP0O2137636 
cited in the application 
abstract; figure 1 

ANNEN ET AL. : "Characterization of 14 

different putative protein kinase cdNA 

clones of the C4 plant Sorghum bicolor" 

EMBL SEQUENCE DATABASE, 

18 August 1998 (1998-08-18), XP002137637 

HEIDELBERG DE 

Ac AA738543 

the whole document 

& MOL. GEN. GENET., 

vol. 259, no. 1, 1998, pages 115-122, 

-/-- 



5,14,15, 
18 



Fomi PCT/iSA/210 (conttnualioo o* second sheet) (Juty 1992) 
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Application No 
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C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



A 



Cftation of document, with indication, where a^ropriate. of the retevant passages 



MAN ET AL. : "Potato SNFl-related protein 

kinase: molecular cloning , expression, 

analysis and peptide kinase activity 

measurements" 

EMBL SEQUENCE DATABASE, 

1 November 1996 (1996-11-01), XP002137638 

HEIDELBERG DE 

Ac Q41485 

the whole document 

& PLANT MOL BIOL, 

vol. 34, 1997, pages 31-43, 

GUMPEL: "C. sativus mRNA dor SNFl-related 

protein kinase" 

EMBL SEQUENCE DATABASE, 

17 December 1996 (1996-12-17), XP002137639 

HEIDELBERG DE 

Ac Y10035 

the whole document 



Relevant to claim No. 



1 



Form PCT/lSA/210 {continuation of second sheet) (July 1932} 
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INTERNATIONAL SEARCH REPORT 




lnt& itiona) application No. 

PCT/US 99/29824 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 f first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

* 1 . |_ Claims Nos. : 

because they relate to subject matter not required to be searched by this Authority, namely: 



Claims Nos. : 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an e)(tent that no meaningful International Search can be carried out, specifically: 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows; 



□ 



As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
searchable claims. ^ 



2. I I As all searchable claims could t>e searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid, sfiecifically claims Nos.: 



4. I y J No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



1-23 ALL PARTLY 



Remark on Protest 



j I The additional search fees were accompanied by the applicant's protest. 
j j No protest accompanied the payment of additional search fees. 



Form PCT/ISA/21 0 (continuation of first sheet (1)) (July 1998) 



Internationa) Application No. PCT/US 99/29824 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



1, Claims: 1-23 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 1, 2, 23, 24. 



2. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl. protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 3, 4. 



3. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 5,6. 



4. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 7, 8. 

5. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 9, 10. 



6. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 11, 12. 



7. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 13, 14. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



8. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 15, 15. 

9. Claims: 1-23 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 17, 18, 25, 25. 



10. Claims: 1-21 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 19, 20. 



11. Claims: 1-23 all partly 

Isolated polynucleotide and polypeptide sequences encoding 
a plant protein (SNFl protein kinase) involved in carbon 
catabolite repression and their selection and uses where the 
sequences are seq id nos 21, 22, 27, 28. 
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PATEJSrr CCX)PERAT10N "mEATY 




From the 

INTERNATIONAL PRELIMINARY EXAMINING ALTTHORITY 






To: 



5?^FER, 



SCHA^FER, Andrew L. 
DU PONT PHARMACEUTICALS COMPANY 

Legal Paftent P^^^^'^^^^^^E^^^rff A 
1G07 Market Sffeih^^^^ 
Wilmington, Delaware 19898 
ETATS-UNIS D'AMERIQUE 



CeVED 



MAR 0 6 2001 
PCT '■^^^Nf^ORDS 





NOTIFICATION OF TRANSMITTAL OF 
THE INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 

(PCT Rule 71.1) 





Date of mailing 

(day/monm/year) 23.02.2001 


Applicanf s or agenf s file reference 
BB1315 


IMPORTANT NOTIRCATON 


International application No. 
PCT/US99/29824 


International filing date (day/month/year) 
15/12/1999 


Priority date (day/monttvyear) 
16/12/1998 


Applicant 

E.I. DU PONT DE NEMOURS AND COMPANY et al. 



1 . . The applicant is hereby notified that this International Preliminaiy Examining Authority transmits herewith the 
intemational preliminary examination report and its annexes, If any, established on the intemational application. 



2. A copy of the report and. Its annexes, if any, is being transmitted to the Intemational Bureau for communication 
to all the elected Offices. 



3. Where required by any of the elected Offices, the International Bureau will prepare an English translation of the 
report (but not of any annexes) and will transmit such translation to those Offices. 



4. REMINDER 

The applicant must enter the national phase before each elected Office by performing certain acts (filing 
translations and paying national fees) within 30 months from the priority date (or later In some Offices) (Article 
39(1)) (see also the reminder sent by the Intemational Bureau vAth Form PCT/IB/301). 

Where a translation of the international application must be fumished to an elected Office, that translation must 
contain a translation of any annexes to the intemational preliminary examination report. It is the applicant's 
responsibility to prepare and fumish such translation directly to each elected Office concemed. 

For further details on the applicable time limits and requirements of the elected Offices, see Volume II of the 
: PCT Applicant's Guide. 



Name and mailing address of the I PEA/ 



European Patent Office 
D-80298 Munich 

Tel. +49 89 2399 - 0 Tx: 523656 pmu d 
Fax: +49 89 2399 - 4465 



J 



Authorized officer 



Hingel. W 



Tel.+49 89 2399-8717 




form PCT/IPEA/4i6 (JuJy 1992) 
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I 



Aatent cooperation.tf^ty 

POT 

INTERNATIONAL PRELIMINARY EXAMINATION REPORT 

(PCT Article 36 and Rule 70) 



Applicant's or agents file reference 
BB1315 



Intematior^ application No. 
PCT/US99/29824 



t=r\D CI iDTucD A/^-r r\ Notification of Transmittal of International 

FOR FURTHER ACTION Preliminary Examination Report (Fomi PCT/IPEA/416) 



Intemationai filing date (day/month/year) 
1 5/1 2/1 999 



Priority date (day/month/^ear) 
1 6/1 2/1 998 



Intennational Patent Classification (IPC) or national classification and IPC 
CI 2N 15/54 



Applicant 

E.I. DU PONT DE NEMOURS. AND COMPANY et al. 



1. This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 9 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 

Basis of the report 
Priority 

Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 
Lack of unity of invention 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations suporting such statement 

Certain documents cited 

□ Certain defects in the international application 

Certain observations on the intemationai application 



1 




II 


□ 


III 


SI 


IV 


□ 


V 




VI 


□ 


VII 


□ 


VIII 





Date of sut>mission of the demand 



10/07/2000 



Name and mailing address of the intemationai 
preliminary xamining authority: 

European Patent Office 
D-80298 Munich 

Tel. +49 89 2399 - 0 Tx: 523656 epmu d 

Fax: +49 89 2399 - 4465 



Date of completion of this report 



23.02.2001 



Authorized officer 
Paresce, D 

T lephone No. +49 89 2399 8995 




Fomn PCT/1 PEA/409 (cover sheet) (January 1994) 



INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/US99/29824 



I. Basis f the rep rt 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Articie 14 are referred to in this report as "originaliy fiied" and are not annexed to 
the report since they do not contain amendments (Rules 70. 16 and 70. 17). y. 
Description, pages: 

1-26 as originally filed 

Claims, No.: 

1-23 as originally filed 

Sequence listing part of the description, pages: 

1 -28, as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or fumished to this Authority in the following language: , which is: 

□ the language of a translation fumished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation fumished for the purposes of intemational preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the intemational application, the 
intemational preliminary examination was carried out on the basis of the sequence listing: 

H contained in the intemational application in written form. 

H filed together with the international application in computer readable form. 

□ fumished subsequently to this Authority in written fomri. 

□ fumished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the intemational application as filed has been fumished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been fumished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 



PCT/IPEA/409 (Boxes l-VIII, Sh et 1) (Juty 1998) 



INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/US99/29824 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 
considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under Item 1 and annexed to Uils 
report.) - 



6. Additional obsen/ations, if necessary: 



ill. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1. The questions whether the claimed invention appears to be novel, to involve an inventive step (to be nbn- ' 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
H claims Nos. 1 -23 partially. 

because: 

□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specif^: 



□ the description, claims or dra\N\nQS {indicate particular elements beloW) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specif^): 



□ ' the claims, or said claims Nos: are so inadequately supported by the description that no meaningful opinion 

could be formed. 

H no intemational search report has been established for the said claims Nos. 1-23 partially. 

2. A meaningful intemational preliminary examination report cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written fomn has not been fumished or does not comply with the standard. 

□ the computer readable form has not been fumished or does not comply with the standard. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 

Novelty (N) Yes: Claims 1-23 partially 

No: Claims 

Inventive step (IS) Yes: Claims 



Form PCT/IPEA/409 (Boxes l-VIII. Sheet 2) (Juty 1998) 



INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/US99/29824 



No: 



Claims 1 -23 partially 



Industrial applicability (lA) Yes: 

No: 



Clainns 1 -23 partially 
Claims 



2. Citations and explanations 
see separate sheet 



VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 



Form PCT/IPEA/4(» (Boxes l-VIII. Sheet 3) (July 1998) 



m 

INTERNATIONAL PRELIMINARY international application No. PCT/US99/29824 

EXAMINATION REPORT - SEPARATE SHEET 



R It ml 

Basis of the report 

This International Preliminary Examination Report will be based on the subject- 
matter of claims 1-23 partially (see below). 

Re Item III 

Non-establishment of opinion with regard to novelty, inventive step and 
industrial applicability 

The Examining Division agrees with the objection put forward by the Search 
Division as to lack of unity (Rule 13.1 PCT). As the Applicant has not had a search 
report drawn up on the other invention (Rule 68.5 PCT), the application will be 
prosecuted on the basis of the invention in respect of which a search has already 
been carried out, in other words, the invention first mentioned in the claims. This 
includes the subject-matter of claims 1-23 partially. These claims are directed to 
isolated polynucleotide and polypeptide sequences encoding a plant protein 
(SNF1 protein kinase) involved in carbon catabolite repression, wherein the 
sequences are SEQ ID NOs; 1 , 2, 23 and 24. The claims are further directed to 
the methods of selection and uses of said sequences (see International Search 
Report). 

The Applicant's attention is drawn to the fact that claims, or parts of claims, 
relating to inventions in respect of which no international search report has been 
established need not be the subject of an international preliminary examination 
(Rule 66.1(e) PCT). The Applicant is advised that the EPO policy when acting as 
an Intemational Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is the case 
irrespective of whether or not the claims are amended following receipt of the 
search report of during any Chapter II procedure. The applicant should therefore, 
limit the application to the invention searched and excise those parts of the 
application relating to the other inventions. 

R It mV 

R as n d stat m nt under ArticI 35(2) with r gard to nov Ity, inv ntiv st p or 



Form PCT/Separate Sheet/409 (Sheet 1) (EPO-April 1997) 



INTERNATIONAL PRELIMINARY International application No. PCT/US99/29824 

EXAMINATION REPORT - SEPARATE SHEET 



Industrial applicability; citations and xplanations supporting such statement 

The documents mentioned in this communication are numbered as in the search 
report, i.e. D1 corresponds to the first document of the search report. 

The invention of the present application is the provision of nucleic acid sequences 
encoding all or portions of a corn sucrose non-fermenting protein (SNF1). At the 
priority date of the present application, SNF1 was known to play a central role in 
carbon catabolite repression in yeast. It was identified by screening for genes 
involved in the regulation of invertase gene expression and is required for the 
expression of several glucose-repressible genes in response to glucose 
deprivation (see D5, page 8604). In yeast, the SNF1 gene encodes a 72 kDa 
serine and threonine protein kinase whose activity is down regulated by the 
availability of glucose. In addition, several plant SNF1-like genes had been 
identified in the prior art. D1 discloses a wheat SNF1 protein kinase and its 
nucleotide and deduced amino acid sequence (D1, figure 1). The amino acid 
sequence of the protein kinase was compared to members of the SNF1 kinase 
subfamily. D2 and D3 disclose a rice SNF1 -related protein kinase nucleotide and 
amino acid sequence. The polypeptide sequence disclosed in D3 shows 87.1% 
identity in a 510 amino acid overlap to SEQ ID NO: 4 of the present application. 
D6, D7 and D8 disclose SNF1 protein kinase polynucleotide and polypeptide 
sequences from (respectively) sorghum, potato, and cucumber. D5 describes a 
rye SNF1 protein kinase polynucleotide and polypeptide sequence (see figure 1). 
The polypeptide sequence disclosed in D5 shows 77.9% identity in a 510 amino 
acid overlap to SEQ ID NO: 4 of the present application. Expression of the rye 
SNF1 cDNA in yeast snfl mutants restored SNF1 function, therefore, it was 
suggested that yeast and rye may have similar mechanisms for carbon catabolite 
repression (see abstract). 

The subject-matter of claims 1-23 has not been made available to the public by 
any of the available prior art documents and can therefore be regarded as novel. 

In view of the prior art, the problem to be solved by the present invention may be 
regarded as the provision of a further sequence encoding a plant SNFI-like 
protein. The solution, however, proposed in claims 1-23 of the present application 



Form PCT/Separate Sheet/409 (Sheet 2) (EPO-April 1997) 



INTERNATIONAL PRELIMINARY International application No. PCT/US99/29824 

EXAMINATION REPORT - SEPARATE SHEET 



cannot be considered as involving an inventive step (Article 33(3) PCT) for the 
following reasons. 

The polypeptide encoded by SEQ ID NO: 1 or 23 is characterized in structural 
ternns but not in functional terms. Claim 1 is directed to a polypeptide 
characterized only by an amino acid sequence. Claim 2 is directed to a DNA 
molecule characterized only by a nucleic acid sequence. The DNA or polypeptide 
is thus structurally characterized but the claims include no other technical effect or 
function for said DNA. 

According to page 2 of the description of the present application, SEQ ID NOs 1 , 
2, 23 and 24 are nucleic acid or amino acid sequences encoding a com SNF1 
protein. This was established by computer-assisted sequence alignments in which 
said sequences were compared with sequences of proteins of the SNF1 family. In 
fact, the sequences claimed in the present application were identified by standard 
procedures well-known to a person skilled in the art. cDNA libraries were prepared 
from various plant tissues and cDNA clones encoding SNF1-like proteins were 
identified by conducting BLAST. The cDNA sequences were compared to all 
publicly available DNA sequences and the nucleic acid fragments encoding 
probable SNF1 proteins were identified. From the sequence alignments with 
proteins of the SNF1 family it is concluded that, "these sequences represent the 
first com, soybean and wheat sequences and a new rice sequence encoding 
SNF1 proteins" (present application, p. 21). At this stage, however, the protein is 
still only structurally defined. The comparison studies do not allow any concrete 
conclusions to be drawn concerning the actual function of the com "SNF1", but 
only speculations that need to be confirmed experimentally. 

If a nucleic "acid or protein is only defined as being related or as having homology 
to another one, it has no definite function. Even though the com SNF1 is defined 
in the present application as belonging to the SNF1 family, different members of 
any family can have very different functions, for example see pages 1-2 of the 
description. Consequently, such a definition cannot be considered to contain any 
technical information. Without any further indication of which of the many functions 
of the SNF family are intended for the claimed protein, the skilled person would be 
left guessing as to what the claimed protein should be used for. Examples 4-6 of 
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the present application merely describe methods of expressing the claimed cDNA 
in various host cells. These examples provide no further information on the 
function or technical features of the claimed protein. 

If an invention should provide a solution to a problem with reference to the 
background art (Article 33(3) PCT), the invention of the present application is 
insufficiently disclosed and unclear (Article 6 PCT) since it is left to the reader to 
perform the invention and to determine what problem if any, the isolated 
nucleotide or polypeptide sequences actually solve. 

Furthermore, due to the fact that the claimed sequences are not associated with 
any known technical effect, the only problem which might be recognized is the 
mere provision of further DMA sequences as such, regardless of their possible 
useful properties. In this case all known DNA sequences are equally suitable 
candidates for solving the above "technical problem" and would, therefore, all 
equally be suggested to the skilled person. The arbitrary selection from an infinite 
number of equally obvious possible solutions cannot involve an inventive step 
because, in order to be patentable, the selection must not be arbitrary but must be 
justified by the technical purpose, e.g. by a hitherto unknown or unexpected 
technical effect which is caused by those structural features distinguishing the 
claimed compounds from the numerous other ones. 

Furthermore, procedures for screening and cloning using EST sequences were 
routine at the priority date of the present application (16.12.98). Therefore, in 
general, the skilled person could be expected to perform the cloning of a gene 
with a reasonable expectation of success and without undue burden. 
Furthermore, procedures for performing sequence alignments and production of 
expression systems are all standard procedures well known to a person skilled in 
the art. Therefore, the IPEA is of the opinion that the polynucleotide or polypeptide 
sequences in question cannot be considered to fulfill the requirements of Article 6 
PCT with regard to clarity as well as the requirements of Article 33(3) PCT with 
respect to inventive step. 

Furthermore, in so far as the present specification is completely silent with regard 
to the function of the protein encoded by SEQ ID NOs:1 , 2, 23, 24, the possible 
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fields of application for the claimed protein and polynucleotides suggested in the 
specification are only speculative. Thus it is questionable whether the claimed 
subject-matter actually meets the requirements set forth in Article 33(4) PCT. 

VIII. Certain observations on the international application 

1) Clarity: Articles PCT 

Article 6 PCT requires amongst other things that the claims, which define the 
matter for which protection is sought (i.e. the object of invention) be clear. This 
has to be interpreted as meaning not only that a claim from a technical point of 
view must be comprehensible, but also that it must define clearly the object of the 
invention, that is to say, it must indicate all the essential features thereof. In the 
present case, claim 1 refers to a polypeptide comprising a given sequence. An 
amino acid sequence, however, does not provide sufficient information to clearly 
characterize the claimed polypeptide. The polypeptide in question should be 
additionally characterized by name, source and function. Claim 2 at present 
covers DN A sequences which encode a polypeptide without any biological 
function. These claims should also include the function of said polypeptide. 
However, the attention of the applicant is directed to the fact that such a functional 
limitation must have a basis in the disclosure of the present application. 
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INTERNATIONAL PRELIMINARY 
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I. Basis fth r p rt 

1 . This report has been drawn on the basis of (substitute stieets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments (Rules 70. 16 and 70.17),): 
Description, pages: 

1 -26 as originally filed 



Sequence listing part of the description, pages: 

1-28, as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the intemational application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

la contained in the international application in written form. 

♦ 

H filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 



Claims, No.: 



1-23 



as originally filed 



4. The amendments have resulted in the cancellation of: 



□ the description, 

□ the claims, 

□ the drawings, 



pages: 



sheets: 



Nos.: 
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5. □ This report has been established as If (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional obsen/ations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
H claims Nos. 1-23 partially. 

because: 

□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specif)^: 

□ the description, claims or drawings (indicate particular elements belov\/} or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specityj: 

□ the claims, or said claims Nos. are so inadequately supported by the description that no meaningful opinion 
could be formed. 

la no international search report has been established for the said claims Nos. 1-23 partially. 

2. A meaningful international preliminary examination report cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been fumished or does not comply with the standard. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement . 

Novelty (N) Yes: Claims 1-23 partially 

No: Claims 

Inventive step (IS) Yes: Claims 
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No: 



Claims 1-23 partially 



Industrial applicability (lA) Yes: 

No: 



Claims 1-23 partially 
Claims 



2. Citations and explanations 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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Re Item I 

Basis of the report 

This International Preliminary Examination Report will be based on the subject- 
matter of claims 1-23 partially (see below). 

Re Item III 

Non-establishment of opinion with regard to novelty, inventive step and 
industrial applicability 

The Examining Division agrees with the objection put forward by the Search 
Division as to lack of unity (Rule 13.1 PCT). As the Applicant has not had a search 
report drawn up on the other invention (Rule 68.5 PCT), the application will be 
prosecuted on the basis of the invention in respect of which a search has already 
been carried out, in other words, the invention first mentioned in the claims. This 
includes the subject-matter of claims 1-23 partially. These claims are directed to 
isolated polynucleotide and polypeptide sequences encoding a plant protein 
(SNF1 protein kinase) involved in carbon catabolite repression, wherein the 
sequences are SEQ ID NOs: 1 , 2, 23 and 24. The claims are further directed to 
the methods of selection and uses of said sequences (see International Search 
Report). 

The Applicant's attention is drawn to the fact that claims, or parts of claims, 
relating to inventions in respect of which no international search report has been 
established need not be the subject of an intemational preliminary examination 
(Rule 66.1(e) PCT). The Applicant is advised that the EPO policy when acting as 
an Jnternatipnal Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is the case 
irrespective of whether or not the claims are amended following receipt of the 
search report of during any Chapter II procedure. The applicant should therefore, 
limit the application to the invention searched and excise those parts of the 
application relating to the other inventions. 

Re Item V 

Reasoned statem nt under Article 35(2) with regard to novelty, inventiv st p or 
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industrial applicability; citations and xplanations supporting such stat ment 

The documents mentioned in this communication are numbered as in the search 
report, i.e. D1 corresponds to the first document of the search report. 

The invention of the present application is the provision of nucleic acid sequences 
encoding all or portions of a corn sucrose non-fermenting protein (SNF1). At the 
priority date of the present application, SNF1 was known to play a central role in 
carbon catabolite repression in yeast. It was identified by screening for genes 
involved in the regulation of invertase gene expression and is required for the 
expression of several glucose-repressible genes in response to glucose 
deprivation (see D5, page 8604). In yeast, the SNF1 gene encodes a 72 kDa 
serine and threonine protein kinase whose activity is down regulated by the 
availability of glucose. In addition, several plant SNFI -like genes had been 
identified in the prior art. D1 discloses a wheat SNF1 protein kinase and its 
nucleotide and deduced amino acid sequence (D1, figure 1). The amino acid 
sequence of the protein kinase was compared to members of the SNF1 kinase 
subfamily. D2 and D3 disclose a rice SNFt-related protein kinase nucleotide and 
amino acid sequence. The polypeptide sequence disclosed in D3 shows 87.1% 
identity in a 510 amino acid overlap to SEQ ID NO: 4 of the present application. 
D6, D7 and D8 disclose SNF1 protein kinase polynucleotide and polypeptide 
sequences from (respectively) sorghum, potato, and cucumber. D5 describes a 
rye SNF1 protein kinase polynucleotide and polypeptide sequence (see figure 1). 
The polypeptide sequence disclosed in D5 shows 77.9% identity in a 510 amino 
acid overlap to SEQ ID NO: 4 of the present application. Expression of the rye 
SNF1 cDNA in yeast snfl mutants restored SNF1 function, therefore, it was 
suggested that yeast and rye may have similar mechanisms for carbon catabolite 
repression (see abstract). 

The subject-matter of claims 1 -23 has not been made available to the public by 
any of the available prior art documents and can therefore be regarded as novel. 

In view of the prior art, the problem to be solved by the present invention may be 
regarded as the provision of a further sequence encoding a plant SNFI-like 
protein. The solution, however, proposed in claims 1-23 of the present application 
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cannot be considered as involving an inventive step (Article 33(3) PCT) for the 
following reasons. 

The polypeptide encoded by SEQ ID NO: 1 or 23 is characterized in structural 
terms but not in functional terms. Claim 1 is directed to a polypeptide 
characterized only by an amino acid sequence. Claim 2 is directed to a DMA 
molecule characterized only by a nucleic acid sequence. The DMA or polypeptide 
is thus structurally characterized but the claims include no other technical effect or 
function for said DMA. 

According to page 2 of the description of the present application, SEQ ID NOs 1 , 
2, 23 and 24 are nucleic acid or amino acid sequences encoding a corn SNF1 
protein. This was established by computer-assisted sequence alignments in which 
said sequences were compared with sequences of proteins of the SNF1 family. In 
fact, the sequences claimed in the present application were identified by standard 
procedures well-known to a person skilled in the art. cDN A libraries were prepared 
from various plant tissues and cDNA clones encoding SNF1-like proteins were 
identified by conducting BLAST. The cDNA sequences were compared to all 
publicly available DNA sequences and the nucleic acid fragments encoding 
probable SNF1 proteins were identified. From the sequence alignments with 
proteins of the SNF1 family it is concluded that, "these sequences represent the 
first corn, soybean and wheat sequences and a new rice sequence encoding 
SNF1 proteins" (present application, p. 21). At this stage, however, the protein is 
still only structurally defined. The comparison studies do not allow any concrete 
conclusions to be drawn concerning the actual function of the corn "SNF1 ", but 
only speculations that need to be confirmed experimentally. 

t - 

If a nucleic acid or protein is only defined as being related or as having homology 
to another one, it has no definite function. Even though the corn SNF1 is defined 
in the present application as belonging to the SNF1 family, different members of 
any family can have very different functions, for example see pages 1-2 of the 
description. Consequently, such a definition cannot be considered to contain any 
technical information. Without any further indication of which of the many functions 
of the SNF family are intended for the claimed protein, the skilled person would be 
left guessing as to what the claimed protein should be used for. Examples 4-6 of 
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the present application merely describe methods of expressing the claimed cDNA 
in various host cells. These examples provide no further information on the 
function or technical features of the claimed protein. 

If an invention should provide a solution to a problem with reference to the 
background art (Article 33(3) PCT), the invention of the present application is 
insufficiently disclosed and unclear (Article 6 PCT) since it is left to the reader to 
perform the invention and to determine what problem if any, the isolated 
nucleotide or polypeptide sequences actually solve. 

Furthermore, due to the fact that the claimed sequences are not associated with 
any known technical effect, the only problem which might be recognized is the 
mere provision of further DMA sequences as such, regardless of their possible 
useful properties. In this case all known DNA sequences are equally suitable 
candidates for solving the above "technical problem" and would, therefore, all 
equally be suggested to the skilled person. The arbitrary selection from an infinite 
number of equally obvious possible solutions cannot involve an inventive step 
because, in order to be patentable, the selection must not be arbitrary but must be 
justified by the technical purpose, e.g. by a hitherto unknown or unexpected 
technical effect which is caused by those structural features distinguishing the 
claimed compounds from the numerous other ones. 

Furthermore, procedures for screening and cloning using EST sequences were 
routine at the priority date of the present application (16.12.98). Therefore, in 
general, the skilled person could be expected to perform the cloning of a gene 
with a reasonable expectation of success and without undue burden. 
Furthermore, procedures for performing sequence alignments and production of 
expression systems are all standard procedures well known to a person skilled in 
the art. Therefore, the IPEA is of the opinion that the polynucleotide or polypeptide 
sequences in question cannot be considered to fulfill the requirements of Article 6 
PCT with regard to clarity as well as the requirements of Article 33(3) PCT with 
respect to inventive step. 

Furthermore, in so far as the present specification is completely silent with regard 
to the function of the protein encoded by SEQ ID NOs:1 , 2, 23. 24, the possible 
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fields of application for the claimed protein and polynucleotides suggested in the 
specification are only speculative. Thus it is questionable whether the claimed 
subject-matter actually meets the requirements set forth in Article 33(4) PCT. 

VIII. Certain observations on the international application 

1) Clarity: Article 6 PCT 

Article 6 PCT requires amongst other things that the claims, which define the 
matter for which protection is sought (i.e. the object of invention) be clear. This 
has to be interpreted as meaning not only that a claim from a technical point of 
view must be comprehensible, but also that it must define clearly the object of the 
invention, that is to say, it must indicate all the essential features thereof. In the 
present case, claim 1 refers to a polypeptide comprising a given sequence. An 
amino acid sequence, however, does not provide sufficient information to clearly 
characterize the claimed polypeptide. The polypeptide in question should be 
additionally characterized by name, source and function. Claim 2 at present 
covers DNA sequences which encode a polypeptide without any biological 
function. These claims should also include the function of said polypeptide. 
However, the attention of the applicant is directed to the fact that such a functional 
limitation must have a basis in the disclosure of the present application. 
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TITLE 

PLANT CARBON CATABOLITE REPRESSION PROTEINS 
This application claims the benefit of U.S. Provisional Application No. 60/11 2,563, 
filed December 16, 1998. 
5 FIELD OF THE INVENTION 

This invention is in the field of plant molecular biology. More specifically, this 
invention pertains to nucleic acid fragments encoding proteins involved in carbon catabolite 
repression in plants and seeds. 

BACKGROUND OF THE INVENTION 

10 Improvement of crop plants for a variety of traits, including disease and pest 

resistance, and grain quality improvements such as oil, starch or protein composition are 
major goals of plant breeders. Conventional plant breeding will continue to provide 
improved crops, however metabolic engineering through biotechnology may provide the 
greatest potential for crop improvement. 

15 The major storage compounds found in most plants are carbohydrates, lipids and 

proteins and the relative abundance and proportions of these compounds vary between 
plants and in plant storage and vegetative organs. Lipids and proteins are the major storage 
compoiinds in peanuts and soybeans, whereas carbohydrate is the predominant form in 
cereals. Specific mechanisms control the allocation of photoassimilates into the carbon and 

20 nitrogen metabolic pathways that produce these storage compounds, however, little is 

known about the regulation of these pathways. Many of the plant genes that are involved in 
photosynthesis and metabolic pathways appear to be regulated by catabolite repression 
mechanisms (Sheen, J. (1990) Plant Cell 2:2107-2138). Catabolite repression is a general 
mechanism utilized by prokaryotes and lower eukaryotes to regulate carbon and nitrogen 

25 metabolism. Plant homologs of several proteins involved in catabolite regulation of various 
metabolic pathways in bacterial and yeast suggest that similar regulatory mechanisms exist 
in plants (Alderson, A. et aL, (1991) PNAS 55:8602-8605 and Sano, H. et al., (1994) PNAS 
P7:2582-2586). 

In yeast, carbon sensing and glucose repression is regulated by a cAMP independent 
30 pathway. One component of the glucose repression pathway is the sucrose non-fermenting 
(SNFl) protein. SNFl is a global regulatory protein that is required for the expression of 
glucose-repressible genes. The SNFl gene encodes a 72 kDa serine and threonine protein 
kinase whose activity is down regulated by the availability of glucose (Celenza, J. L. et al., 
(1986) Science 233:\ 1 75-1 1 80). The PH085 gene of yeast encodes a glycogen synthase 
35 kinase that also appears to be involved in the glucose repression pathway possibly acting 
downstream of SNF 1 . Huang et al. report that cells with disrupted PH085 genes 
accumulated large quantities of glycogen, had an active glycogen synthase and were 
impaired in glycogen synthase kinase activity. In SNFl mutant cells glycogen synthase 

1 
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activity was elevated suggesting that PH085 and SNFl act antagonistically to control 
glycogen accumulation (Huang, D, et aL, (1996) Mol Cell Biol 16(8)'A2S1-A'i65y 

Several plant SNFl -like genes have been identified which suggests that these plant 
genes may constitute a large family of protein kinases involved in carbon catabolite 
5 regulation. One rye SNFl -like cDNA homolog, cRKINI, has been shown to be capable 
of rescuing SNFl mutations in yeast cells suggesting that yeast and rye may have similar 
mechanisms of regulating carbon metabolism (Miao, G. H. (1996) in Plant Gene 
Research: Basic Knowledge and Applcation ed. by D. P. S. Verma, Springer- Verlag, New 
York). Thus there is a great deal of interest in identifying the genes that encode proteins 

10 involved in catabolite repression in plants. These genes may be used in plants to control 
carbon and nitrogen partitioning pathways during plant growth and development. 
Accordingly, the availability of nucleic acid sequences encoding all or a portion of SNFl 
and PH085 regulatory proteins would facilitate studies to better understand the mechanism 
of catabolite repression in plants and could provide genetic tools to enhance or otherwise 

15 alter the accumulation of carbohydrates, lipids and proteins during plant growth and 
development. 

SUMMARY OF THE INVENTION 
The present invention relates to isolated polynucleotides comprising a nucleotide 
sequence encoding a polypeptide of at least 240 amino acids that has at least 90% identity 

20 based on the Clustal method of alignment when compared to a polypeptide selected from the 
group consisting of com SNFl polypeptides of SEQ ID NOs;2, 4 and 6, a rice SNFl 
polypeptide of SEQ ID NO: 8, soybean SNFl polypeptides of SEQ IDNOsrlO, 12, 14, 16 
and 1 8, wheat SNF 1 polypeptides of SEQ ID NOs:20 and 22. The present invention also 
relates to ain isolated polynucleotide comprising the coniplement of the nucleotide sequences 

25 described above. 

The present invention also relates to isolated polynucleotides comprising a nucleotide 
sequence encoding a polypeptide of at least 77 amino acids that has at least 85% identity 
based on the Clustal method of alignment when compared to a polypeptide selected from the 
group consisting of a com SNFl polypeptide of SEQ ID NO:24, a soybean SNFl 

30 polypeptide of SEQ ID NO:26, and a wheat SNFl polypeptide of SEQ ID NO:28. The 

present invention also relates to an isolated polynucleotide comprising the complement of 
the nucleotide sequences described above. 

It is preferred that the isolated polynucleotides of the claimed invention consists of a 
nucleic acid sequence selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 

35 13, 15, 17, 19, 21, 23, 25 and 27 that codes for the polypeptide selected from the group 

consisting of SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26 and 28. The present 
invention also relates to an isolated polynucleotide comprising a nucleotide sequences of at 
least one of 60 (preferably at least one of 40, most preferably at least one of 30) contiguous 
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nucleotides derived from a nucleotide sequence selected from the group consisting of SEQ 
IDNOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 19,21,23,25 and 27 and the complement of such 
nucleotide sequences. 

The present invention relates to a chimeric gene comprising an isolated polynucleotide 
5 of the present invention operably linked to suitable regulatory sequences. 

The present invention relates to an isolated host cell comprising a chimeric gene of the 
present invention or an isolated polynucleotide of the present invention. The host cell may 
be eukaryotic, such as a yeast or a plant cell, or prokaryotic, such as a bacterial cell. The 
present invention also relates to a virus, preferably a baculovirus, comprising an isolated 
10 polynucleotide of the present invention or a chimeric gene of the present invention. 

The present invention relates to a process for producing an isolated host cell 
comprising a chimeric gene of the present invention or an isolated polynucleotide of the 
present invention, the process comprising either transforming or transfecting an isolated 
compatible host cell with a chimeric gene or isolated polynucleotide of the present invention. 
15 The present invention relates to a SNFl polypeptide of at least 240 amino acids 

comprising at least 90% homology based on the Clustal method of alignment compared to a 
polypeptide selected from the group consisting of SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 
20 and 22. 

The present invention relates to a SNF 1 polypeptide of at least 77 amino acids 

20 comprising at least 85% homology based on the Clustal method of alignment compared to a 
polypeptide selected from the group consisting of SEQ ID NOs:24, 26 and 28. 

The present invention relates to a method of selecting an isolated polynucleotide that 
affects the level of expression of a SNFl polypeptide in a host cell, preferably a plant cell, 
the method comprising the steps of: (a) constructing an isolated polynucleotide of the 

25 present invention or an isolated chimeric gene of the present invention; (b) introducing the 
isolated polynucleotide or the isolated chimeric gene into a host cell; (c) measuring the level 
a SNFl polypeptide in the host cell containing the isolated polynucleotide; and 
(d) comparing the level of a SNF 1 polypeptide in the host cell containing the isolated 
polynucleotide with the level of a SNFl polypeptide in the host cell that does not contain the 

30 isolated polynucleotide. 

The present invention relates to a method of obtaining a nucleic acid fragment 
encoding a substantial portion of a SNFl polypeptide gene, preferably a plant SNFl 
polypeptide gene, comprising the steps of: synthesizing an oligonucleotide primer 
comprising a nucleotide sequence of at least one of 60 (preferably at least one of 40, most 

35 preferably at least one of 30) contiguous nucleotides derived from a nucleotide sequence 

selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25 
and 27 and the complement of such nucleotide sequences; and amplifying a nucleic acid 
fragment (preferably a cDNA inserted in a cloning vector) using the oligonucleotide primer. 

3 
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The amplified nucleic acid fragment preferably will encode a portion of a SNFl amino acid 
sequence. 

The present invention also relates to a method of obtaining a nucleic acid fragment 
encoding all or a substantial portion of the amino acid sequence encoding a SNFl 
5 polypeptide comprising the steps of: probing a cDNA or genomic library with an isolated 
polynucleotide of the present invention; identifying a DNA clone that hybridizes with an 
isolated polynucleotide of the present invention; isolating the identified DNA clone; and 
sequencing the cDNA or genomic fragment that comprises the isolated DNA clone. 

The present invention relates to a composition, such as a hybridization mixture, 
10 comprising an isolated polynucleotide of the present invention. 

The present invention relates to an isolated polynucleotide of the present invention 
comprising at least one of 30 contiguous nucleotides derived from a nucleic acid sequence 
selected from the group consisting of SEQ ID NOsrl, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25 
and 27. 

15 The present invention relates to an expression cassette comprising an isolated 

polynucleotide of the present invention operably linked to a promoter. 

The present invention relates to a method for positive selection of a transformed cell 
comprising: (a) transforming a host cell with the chimeric gene of the present invention or 
an expression cassette of the present invention; and (b) growing the transformed host cell, 

20 preferably plant cell, such as a monocot or a dicot, under conditions which allow expression 
of the SNFl polynucleotide in an amoimt sufficient to complement a null mutant and/or alter 
expression of glucose repressible genes to provide a positive selection means. 

BRIEF DESCRIPTION OF THE SEQUENCE DESCRIPTIONS 
The invention can be more fully understood from the following detailed description 

25 and the accompanying Sequence Listing which form a part of this application. 

Table 1 lists the polypeptides that are described herein, the designation of the 
cDNA clones that comprise the nucleic acid fragments encoding polypeptides representing 
all or a substantial portion of these polypeptides, and the corresponding identifier (SEQ ID 
NO:) as used in the attached Sequence Listing. Table 1 also identifies the cDNA clones as 

30 individual ESTs ("EST"), the sequences of the entire cDNA inserts comprising the 

indicated cDNA clones ("PIS"), contigs assembled from two or more ESTs ("Contig"), 
contigs assembled from an PIS and one or more ESTs ("Contig*"), or sequences encoding 
the entire protein derived fi-om an PIS, a contig, or an PIS and PGR ("COS"). Nucleotide 
sequences, SEQ ID NOs:l, 17 and 21 and amino acid sequences SEQ ID N0s:2, 18 and 

35 22 were determined by further sequence analysis of cDNA clones encoding the amino acid 
sequences set forth in SEQ ID NOs:24, 26 and 28. Nucleotide SEQ ID NOs:23, 25 and 27 
and amino acid SEQ ID NOs:24, 26 and 28 were presented in a U.S. Provisional 
Application No. 60/1 12,563, filed December 16, 1998. 
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The sequence descriptions and Sequence Listing attached hereto comply with the rules 
governing nucleotide and/or amino acid sequence disclosures in patent applications as set 
forth in 37 C.F.R. §1.821-1.825. 



TABLE 1 

Proteins Involved In Carbon Catabolite Repression 



Protein 



Clone Designation 



SEQ ID NO: 



SNFl 


cen3n.pk0044.b8 (CGS) 
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2 
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oINr 1 
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£ 
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oiNr I 


rda.pk0007.g3 
rrl.pkO008.el2 

m.pKUU4/.glZ 


7 


8 
o 


SNFl 


sdp4c.pk016.el0 (CGS) 


9 


10 


SNFl 


sdrlf.pk001.p7 (COS) 


11 


12 


SNFl 


sgs4c.pk006.g6 (CGS) 


13 


14 


SNFl 


Sgs4c.pk006.n21 (CGS) 


15 


16 


SNFl 


srrlc.pk001.i24(GCS) 


17 


18 


SNFl 


Contig composed of: 

wdk2c.pk018.cl6 
wlml.pk0027.al2 


19 


20 


SNFl 


wlm96.pk0007.e4 (CGS) 


21 


22 




cen3n.pk0044.b8 (EST) 


23 


24 




srrlc.pk001.i24 (EST) 


25 


26 




wlm96.pk0007.e4 (EST) 


27 


28 



The Sequence Listing contains the one letter code for nucleotide sequence characters 
and the three letter codes for amino acids as defined in conformity with the lUPAC-IUBMB 
10 standards described in Nucleic Acids Res, 75:3021-3030 (1985) and in the BiochemicalJ. 
219 (No. 2^:345-373 (1984) which are herein incorporated by reference. The symbols and 
format used for nucleotide and amino acid sequence data comply with the rules set forth in 
37 C.F.R. §1.822. 

DETAILED DESCRIPTION OF THE INVENTION 
15 In the context of this disclosure, a number of terms shall be utilized. As used herein, a 

"polynucleotide" is a nucleotide sequence such as a nucleic acid fragment. A polynucleotide 
may be a polymer of RNA or DNA that is single- or double-stranded, that optionally 
contains synthetic, non-natural or altered nucleotide bases. A polynucleotide in the form of 
a polymer of DNA may be comprised of one or more segments of cDNA, genomic DNA, 
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synthetic DN A, or mixtures thereof. An isolated polynucleotide of the present invention 
may include at least one of 60 contiguous nucleotides, preferably at least one of 40 
contiguous nucleotides, most preferably one of at least 30 contiguous nucleotides derived 
from SEQ ID NOs:l,3, 5, 7, 9, 11, 13, 15, 17, 19,21,23,25,27 or the complement of such 
sequences. 

As used herein, "contig" refers to a nucleotide sequence that is assembled from two or 
more constituent nucleotide sequences that share common or overlapping regions of 
sequence homology. For example, the nucleotide sequences of two or more nucleic acid 
fragments can be compared and aligned in order to identify conunon or overlapping 
sequences. Where common or overlapping sequences exist between two or more nucleic 
acid fragments, the sequences (and thus their corresponding nucleic acid fragments) can be 
assembled into a single contiguous nucleotide sequence. 

As used herein, "substantially similar" refers to nucleic acid fragments wherein 
changes in one or more nucleotide bases results in substitution of one or more amino acids, 
but do not affect the functional properties of the polypeptide encoded by the nucleotide 
sequence. "Substantially similar" also refers to nucleic acid fragments wherein changes in 
one or more nucleotide bases does not affect the ability of the nucleic acid fragment to 
mediate alteration of gene expression by gene silencing through for example antisense or co- 
suppression technology. "Substantially similar" also refers to modifications of the nucleic 
acid fragments of the instant invention such as deletion or insertion of one or more 
nucleotides that do not substantially affect the functional properties of the resulting 
transcript vis-a-vis the ability to mediate gene silencing or alteration of the frinctional 
properties of the resulting protein molecule. It is therefore understood that the invention 
encompasses more than the specific exemplary nucleotide or amino acid sequences and 
includes frinctional equivalents thereof. 

Substantially similar nucleic acid fragments may be selected by screening nucleic acid 
fragments representing subfragments or modifications of the nucleic acid fragments of the 
instant invention, wherein one or more nucleotides are substituted, deleted and/or inserted, 
for their ability to affect the level of the polypeptide encoded by the unmodified nucleic acid 
fragment in a plant or plant cell. For example, a substantially similar nucleic acid fragment 
representing at least one of 30 contiguous nucleotides derived from the instant nucleic acid 
fragment can be constructed and introduced into a plant or plant cell. The level of the 
polypeptide encoded by the unmodified nucleic acid fragment present in a plant or plant cell 
exposed to the substantially similar nucleic fragment can then be compared to the level of 
the polypeptide in a plant or plant cell that is not exposed to the substantially similar nucleic 
acid fragment. 

For example, it is well known in the art that antisense suppression and co-suppression 
of gene expression may be accomplished using nucleic acid fragments representing less than 
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the entire coding region of a gene, and by nucleic acid fragments that do not share 100% 
sequence identity with the gene to be suppressed. Moreover, alterations in a nucleic acid 
fragment which result in the production of a chemically equivalent amino acid at a given 
site, but do not effect the functional properties of the encoded polypeptide, are well known in 
the art. Thus, a codon for the amino acid alanine, a hydrophobic amino acid, may be 
substituted by a codon encoding another less hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes which result 
in substitution of one negatively charged residue for another, such as aspartic acid for 
glutamic acid, or one positively charged residue for another, such as lysine for arginine, can 
also be expected to produce a functionally equivalent product. Nucleotide changes which 
result in alteration of the N-terminal and C-terminal portions of the polypeptide molecule 
would also not be expected to alter the activity of the polypeptide. Each of the proposed 
modifications is well within the routine skill in the art, as is determination of retention of 
biological activity of the encoded products. Consequently, an isolated polynucleotide 
comprising a nucleotide sequence of at least one of 60 (preferably at least one of 40, most 
preferably at least one of 30) contiguous nucleotides derived from a nucleotide sequence 
selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27 and the complement of such nucleotide sequences may be used in methods of 
selecting an isolated polynucleotide that affects the expression of a polypeptide (such as 
SNFl) in a host cell. A method of selecting an isolated polynucleotide that affects the level 
of expression of a polypeptide (such as SNFl) in a host cell (eukaryotic, such as plant or 
yeast, prokaryotic such as bacterial, or viral) may comprise the steps of: constructing an 
isolated polynucleotide of the present invention or an isolated chimeric gene of the present 
invention; introducing the isolated polynucleotide or the isolated chimeric gene into a host 
cell; measuring the level a polypeptide in the host cell containing the isolated 
polynucleotide; and comparing the level of a polypeptide in the host cell containing the 
isolated polynucleotide with the level of a polypeptide in a host cell that does not contain the 
isolated polynucleotide. 

Moreover, substantially similar nucleic acid fragments may also be characterized by 
their ability to hybridize. Estimates of such homology are provided by either DNA-DNA or 
DNA-RNA hybridization imder conditions of stringency as is well imderstood by those 
skilled in the art (Hames and Higgins, Eds. (1985) Nucleic Acid Hybridisation, IRL Press, 
Oxford, U.K.). Stringency conditions can be adjusted to screen for moderately similar 
fragments, such as homologous sequences from distantly related organisms, to highly similar 
fragments, such as genes that duplicate fiinctionai enzymes from closely related organisms. 
Post-hybridization washes determine stringency conditions. One set of preferred conditions 
uses a series of washes starting with 6X SSC, 0.5% SDS at room temperature for 15 min, 
then repeated with 2X SSC, 0.5% SDS at 45°C for 30 min, and then repeated twice with 
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0.2X SSC, 0.5% SDS at 50*^C for 30 min. A more preferred set of stringent conditions uses 
higher temperatures in which the washes are identical to those above except for the 
temperature of the final two 30 min washes in 0.2X SSC, 0,5% SDS was increased to 60°C. 
Another preferred set of highly stringent conditions uses two final washes in 0.1 X SSC, 
5 0.1%SDSat65*'C. 

Substantially similar nucleic acid fragments of the instant invention may also be 
characterized by the percent identity of the amino acid sequences that they encode to the 
amino acid sequences disclosed herein, as determined by algorithms commonly employed by 
those skilled in this art. Suitable nucleic acid firagments (isolated polynucleotides of the 

10 present invention) encode polypeptides that are at least about 70% identical, preferably at 
least about 80% identical to the amino acid sequences reported herein. Preferred nucleic 
acid fragments encode amino acid sequences that are about 85% identical to the amino acid 
sequences reported herein. More preferred nucleic acid fragments encode amino acid 
sequences that are at least about 90% identical to the amino acid sequences reported herein. 

15 Most preferred are nucleic acid fragments that encode amino acid sequences that are at least 
about 95% identical to the amino acid sequences reported herein. Suitable nucleic acid 
fragments not only have the above homologies but typically encode a polypeptide having at 
least about 50 amino acids, preferably at least about 100 amino acids, more preferably at 
least about 150 amino acids, still more preferably at least about 200 amino acids, and most 

20 preferably at least about 250 amino acids. Sequence alignments and percent identity 
calculations were performed using the Megalign program of the LASERGENE 
bioinformatics computing suite (DNASTAR Inc., Madison, WI). Multiple alignment of the 
sequences was performed using the Clustal method of alignment (Higgins and Sharp (1989) 
CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH 

25 PENALTY=10). Default parameters for pairwise alignments using the Clustal method were 
KTUPLE 1, GAP PENALTY=3, WIND0W=5 and DIAGONALS SAVED=5. 

A "substantial portion" of an amino acid or nucleotide sequence comprises an amino 
acid or a nucleotide sequence that is sufficient to afford putative identification of the protein 
or gene that the amino acid or nucleotide sequence comprises. Amino acid and nucleotide 

30 sequences can be evaluated either manually by one skilled in the art, or by using computer- 
based sequence comparison and identification tools that employ algorithms such as BLAST 
(Basic Local Alignment Search Tool; Altschul et al. (1993) J. MoL Biol 275:403^10; see 
also www.ncbi.nIm.nih.gov/BLAST/). In general, a sequence of ten or more contiguous 
amino acids or thirty or more contiguous nucleotides is necessary in order to putatively 

35 identify a polypeptide or nucleic acid sequence as homologous to a known protein or gene. 
Moreover, with respect to nucleotide sequences, gene-specific oligonucleotide probes 
comprising 30 or more contiguous nucleotides may be used in sequence-dependent methods 
of gene identification (e.g., Southern hybridization) and isolation (e.g., in situ hybridization 

8 
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of bacterial colonies or bacteriophage plaques). In addition, short oligonucleotides of 12 or 
more nucleotides may be used as amplification primers in PGR in order to obtain a particular 
nucleic acid fragment comprising the primers. Accordingly, a "substantial portion" of a 
nucleotide sequence comprises a nucleotide sequence that will afford specific identification 
5 and/or isolation of a nucleic acid fragment comprising the sequence. The instant 

specification teaches amino acid and nucleotide sequences encoding polypeptides that 
comprise one or more particular plant proteins. The skilled artisan, having the benefit of the 
sequences as reported herein, may now use all or a substantial portion of the disclosed 
sequences for purposes known to those skilled in this art. Accordingly, the instant invention 

10 comprises the complete sequences as reported in the accompanying Sequence Listing, as 
well as substantial portions of those sequences as defined above. 

"Codon degeneracy" refers to divergence in the genetic code permitting variation of 
the nucleotide sequence without effecting the amino acid sequence of an encoded 
polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment 

15 comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid 
sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited 
by a specific host cell in usage of nucleotide codons to specify a given amino acid. 
Therefore, when synthesizing a nucleic acid firagment for improved expression in a host cell, 
it is desirable to design the nucleic acid fragment such that its frequency of codon usage 

20 approaches the frequency of preferred codon usage of the host cell. 

"Synthetic nucleic acid fragments" can be assembled from oligonucleotide building 
blocks that are chemically synthesized using procedures known to those skilled in the art. 
These building blocks are ligated and annealed to form larger nucleic acid fragments which 
may then be enzymatically assembled to construct the entire desired nucleic acid fragment. 

25 "Chemically synthesized", as related to nucleic acid fragment, means that the component 
nucleotides were assembled in vitro: Manual chemical synthesis of nucleic acid fragments 
may be accomplished using well established procedures, or automated chemical synthesis 
can be performed using one of a number of commercially available machines. Accordingly, 
the nucleic acid fragments can be tailored for optimal gene expression based on optimization 

30 of nucleotide sequence to reflect the codon bias of the host cell. The skilled artisan 

appreciates the likelihood of successful gene expression if codon usage is biased towards 
those codons favored by the host. Determination of preferred codons can be based on a 
survey of genes derived from the host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, including 

35 regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 
own regulatory sequences. "Chimeric gene" refers any gene that is not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 

9 
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Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 
are derived from different sources, or regulatory sequences and coding sequences derived 
from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
5 organism. A "foreign" gene refers to a gene not normally found in the host organism, but 
that is introduced into the host organism by gene transfer. Foreign genes can comprise 
native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 
that has been introduced into the genome by a transformation procedure. 

"Coding sequence" refers to a nucleotide sequence that codes for a specific amino acid 

10 sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5* non- 
coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, 
and which influence the transcription, RNA processing or stability, or translation of the 
associated coding sequence. Regulatory sequences may include promoters, translation 
leader sequences, introns, and polyadenylation recognition sequences. 

15 "Promoter" refers to a nucleotide sequence capable of controlling the expression of a 

coding sequence or functional RNA. In general, a coding sequence is located 3* to a 
promoter sequence. The promoter sequence consists of proximal and more distal upstream 
elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a 
nucleotide sequence which can stimulate promoter activity and may be an innate element of 

20 the promoter or a heterologous element inserted to enhance the level or tissue-specificity of 
a promoter. Promoters may be derived in their entirety from a native gene, or be composed 
of different elements derived from different promoters found in nature, or even comprise 
synthetic nucleotide segments. It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different tissues or cell types, or at 

25 different stages of development, or in response to different environmental conditions. 

Promoters which cause a nucleic acid fragment to be expressed in most cell types at most 
times are commonly referred to as "constitutive promoters". New promoters of various 
types useful in plant cells are constantly being discovered; numerous examples may be 
found in the compilation by Okamuro and Goldberg (1989) Biochemistry of Plants 75:1-82. 

30 It is further recognized that since in most cases the exact boundaries of regulatory sequences 
have not been completely defined, nucleic acid fragments of different lengths may have 
identical promoter activity. 

The "translation leader sequence" refers to a nucleotide sequence located between the 
promoter sequence of a gene and the coding sequence. The translation leader sequence is 

35 present in the fully processed mRNA upstream of the translation start sequence. The 
translation leader sequence may affect processing of the primary transcript to mRNA, 
mRNA stability or translation efficiency. Examples of translation leader sequences have 
been described (Turner and Foster (1995) MoL Biotechnol 5:225-236). 
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The "3* non-coding sequences" refer to nucleotide sequences located downstream of a 
coding sequence and include polyadenylation recognition sequences and other sequences 
encoding regulatory signals capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid 

5 tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is 
exempUfied by Ingelbrecht et al. (1989) Plant Cell 7:671-680. 

"RNA transcript" refers to the product resulting from RNA polymerase-catalyzed 
transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 

10 sequence derived from posttranscriptional processing of the primary transcript and is 
referred to as the mature RNA. ''Messenger RNA (mRNA)" refers to the RNA that is 
without introns and that can be translated into polypeptide by the cell. "cDNA" refers to a 
double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 
refers to an RNA transcript that includes the mRNA and so can be translated into a 

15 polypeptide by the cell. "Antisense RNA" refers to an RNA transcript that is 

complementary to all or part of a target primary transcript or mRNA and that blocks the 
expression of a target gene (see U.S. Patent No. 5,107,065, incorporated herein by 
reference). The complementarity of an antisense RNA may be with any part of the specific 
nucleotide sequence, i.e., at the 5* non-coding sequence, 3' non-coding sequence, introns, or 

20 the coding sequence. "Functional RNA" refers to sense RNA, antisense RNA, ribo2:yme 
RNA, or other RNA that may not be translated but yet has an effect on cellular processes. 

The term "operably linked" refers to the association of two or more nucleic acid 
fragments on a single nucleic acid fragment so that the function of one is affected by the 
other. For example, a promoter is operably linked with a coding sequence when it is capable 

25 of affecting the expression of that coding sequence (i.e., that the coding sequence is under 
the transcriptional control of the promoter). Coding sequences can be operably linked to 
regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of 

30 the invention. Expression may also refer to translation of mRNA into a polypeptide. 
"Antisense inhibition" refers to the production of antisense RNA transcripts capable of 
suppressing the expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of production in normal or 
non-transformed organisms. "Co-suppression" refers to the production of sense RNA 

35 transcripts capable of suppressing the expression of identical or substantially similar foreign 
or endogenous genes (U.S. Patent No. 5,231,020, incorporated herein by reference). 

"Altered levels" refers to the production of gene product(s) in transgenic organisms in 
amounts or proportions that differ from that of normal or non-transformed organisms. 
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"Mature" protein refers to a post-translationally processed polypeptide; i.e., one from 
which any pre- or propeptides present in the primary translation product have been removed. 
"Precursor" protein refers to the primary product of translation of mRNA; i.e., with pre- and 
propeptides still present. Pre- and propeptides may be but are not limited to intracellular 
5 localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is translated in 
conjunction with a protein and directs the protein to the chloroplast or other plastid types 
present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a 
nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an 

10 amino acid sequence which is translated in conjunction with a protein and directs the protein 
to the secretory system (Chrispeels {\99\) Ann, Rev. Plant Phys. Plant Mol BioL ^2:21-53). 
If the protein is to be directed to a vacuole, a vacuolar targeting signal {supra) can further be 
added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal (supra) 
may be added. If the protein is to be directed to the nucleus, any signal peptide present 

15 should be removed and instead a nuclear localization signal included (Raikhel (1992) Plant 
Phys. 700:1627-1632). 

"Transformation" refers to the transfer of a nucleic acid fragment into the genome of a 
host organism, resulting in genetically stable inheritance. Host organisms containing the 
transformed nucleic acid fragments are referred to as "tremsgenic" organisms. Examples of 

20 methods of plant transformation include Agrobacterium-mcdiatcd transformation (De Blaere 
et al. (1987) Meth. Enzymol. 143:211) and particle-accelerated or "gene gun" transformation 
technology (Klein et al. (1987) Nature (London) 327:10-13; U.S. Patent No. 4,945,050, 
incorporated herein by reference). 

Standard recombinant DN A and molecular cloning techniques used herein are well 

25 known in the art and are described more ftilly in Sambrook et al. Molecular Cloning: A 
Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 
(hereinafter "Maniatis"). 

Nucleic acid fragments encoding at least a portion of several proteins involved in 
carbon catabolite repression have been isolated and identified by comparison of random 

30 plant cDNA sequences to public databases containing nucleotide and protein sequences 
using the BLAST algorithms well known to those skilled in the art. The nucleic acid 
fragments of the instant invention may be used to isolate cDNAs and genes encoding 
homologous proteins from the same or other plant species. Isolation of homologous genes 
using sequence-dependent protocols is well known in the art. Examples of sequence- 

35 dependent protocols include, but are not limited to, methods of nucleic acid hybridization, 
and methods of DNA and RNA amplification as exemplified by various uses of nucleic acid 
amplification technologies (e.g., polymerase chain reaction, ligase chain reaction). 
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For example, genes encoding other SNFl polypeptides, either as cDNAs or genomic 
DNAs, could be isolated directly by using all or a portion of the instant nucleic acid 
fragments as DNA hybridization probes to screen libraries from any desired plant employing 
methodology well known to those skilled in the art. Specific oligonucleotide probes based 
5 upon the instant nucleic acid sequences can be designed and synthesized by methods known 
in the art (Maniatis). Moreover, the entire sequences can be used directly to synthesize 
DNA probes by methods known to the skilled artisan such as random primer DNA labeling, 
nick translation, or end-labeling techniques, or RNA probes using available in vitro 
transcription systems. In addition, specific primers can be designed and used to amplify a 
10 part or all of the insteint sequences. The resulting amplification products can be labeled 

directly during amplification reactions or labeled after amplification reactions, and used as 
probes to isolate fiiU length cDNA or genomic fragments under conditions of appropriate 
stringency. 

In addition, two short segments of the instant nucleic acid fragments may be used in 

15 polymerase cheiin reaction protocols to amplify longer nucleic acid fragments encoding 
homologous genes from DNA or RNA. The polymerase chain reaction may also be 
performed on a library of cloned nucleic acid fragments wherein the sequence of one primer 
is derived from the instant nucleic acid fragments, and the sequence of the other primer takes 
advantage of the presence of the polyadenylic acid tracts to the 3* end of the mRNA 

20 precursor encoding plant genes. Altematively, the second primer sequence may be based 

upon sequences derived from the cloning vector. For example, the skilled artisan can follow 
the RACE protocol (Frohman et al. (1988) Proc, Natl Acad, Sci. USA 55:8998-9002) to 
generate cDNAs by using PGR to amplify copies of the region between a single point in the 
transcript and the 3' or 5* end. Primers oriented in the 3* and 5* directions can be designed 

25 from the instant sequences. Using commercially available 3* RACE or 5' RACE systems 
(BRL), specific 3' or 5' cDNA fragments can be isolated (Ohara et al. (1989) Proc. Natl 
Acad ScL USA 5(5:5673-5677; Loh et al. (19S9) Science 2^3:217-220). Products generated 
by the 3' and 5* RACE procedures can be combined to generate fiill-length cDNAs (Frohman 
and Martin (1989) Techniques 7:165). Consequently, a polynucleotide comprising a 

30 nucleotide sequence of at least one of 60 (preferably one of at least 40, most preferably one 
of at least 30) contiguous nucleotides deri ved from a nucleotide sequence selected from the 
groupconsistingofSEQIDNOs:l,3,5, 7,9, 11, 13, 15, 17, 19, 21, 23, 25, 27 and the 
complement of such nucleotide sequences may be used in such methods to obtain a nucleic 
acid fragment encoding a substantial portion of an amino acid sequence of a polypeptide. 

35 The present invention relates to a method of obtaining a nucleic acid fragment encoding a 

substantial portion of a polypeptide of a gene (such as SNFl) preferably a substantial portion 
of a plant polypeptide of a gene, comprising the steps of: synthesizing an oligonucleotide 
primer comprising a nucleotide sequence of at least one of 60 (preferably at least one of 40, 
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most preferably at least one of 30) contiguous nucleotides derived from a nucleotide 
sequence selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27 and the complement of such nucleotide sequences; and amplifying a nucleic 
acid fragment (preferably a cDNA inserted in a cloning vector) using the oligonucleotide 
5 primer. The amplified nucleic acid fragment preferably will encode a portion of a 
polypeptide (SNFl). 

Availability of the instant nucleotide and deduced amino acid sequences facilitates 
immunological screening of cDNA expression libraries. Synthetic peptides representing 
portions of the instant amino acid sequences may be synthesized. These peptides can be 

10 used to immvmize animals to produce polyclonal or monoclonal antibodies with specificity 
for peptides or proteins comprising the amino acid sequences. These antibodies can be then 
be used to screen cDNA expression libraries to isolate full-length cDNA clones of interest 
(Lemer (1984) ^dv. Immunol. 55:1-34; Maniatis). 

The nucleic acid fragments of the instant invention may be used to create transgenic 

15 plants in which the disclosed polypeptides are present at higher or lower levels than normal 
or in cell types or developmental stages in which they are not normally found. This would 
have the effect of altering the level of catabolite repression in those cells. 

Overexpression of the proteins of the instant invention may be accomplished by first 
constructing a chimeric gene in which the coding region is operably linked to a promoter 

20 capable of directing expression of a gene in the desired tissues at the desired stage of 
development. For reasons of convenience, the chimeric gene may comprise promoter 
sequences and translation leader sequences derived from the same genes. 3* Non-coding 
sequences encoding transcription termination signals may also be provided. The instant 
chimeric gene may also comprise one or more introns in order to facilitate gene expression. 

25 Plasmid vectors comprising the instant chimeric gene can then be constructed. The 

choice of plasmid vector is dependent upon the method that will be used to transform host 
plants. The skilled artisan is well aware of the genetic elements that must be present on the 
plasmid vector in order to successfully transform, select and propagate host cells containing 
the chimeric gene. The skilled artisan will also recognize that different independent 

30 transformation events will result in different levels and pattems of expression (Jones et al. 

{\9%S)EMBOJ. 4:2411-2418;De Almeida etal. (1989) Mo/. Gen, Genetics 21 8:7S'S6% and 
thus that multiple events must be screened in order to obtain lines displaying the desired 
expression level and pattern. Such screening may be accomplished by Southern analysis of 
DNA, Northern analysis of mRNA expression, Western analysis of protein expression, or 

35 phenotypic analysis. 

For some applications it may be usefiil to direct the instant polypeptides to different 
cellular compartments, or to facilitate its secretion from the cell. It is thus envisioned that 
the chimeric gene described above may be further supplemented by directing the coding 
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sequence to encode the instant polypeptides with appropriate intracellular targeting 
sequences such as transit sequences (Keegstra (1989) Cell 56:247-253), signal sequences or 
sequences encoding endoplasmic reticulum localization (Chrispeels (1991) Rev. Plant 
Phys. Plant Mol Biol ^2:21-53), or nuclear localization signals (Raikhel (1992) Plant 
5 Phys, 100: 1 627- 1 632) with or without removing targeting sequences that are already present. 
While the references cited give examples of each of these, the list is not exhaustive and more 
targeting signals of use may be discovered in the future. 

It may also be desirable to reduce or eliminate expression of genes encoding the 
instant polypeptides in plants for some applications. In order to accomplish this, a chimeric 

10 gene designed for co-suppression of the instant polypeptide can be constructed by linking a 
gene or gene fragment encoding that polypeptide to plant promoter sequences. 
Alternatively, a chimeric gene designed to express antisense RNA for all or part of the 
instant nucleic acid fragment can be constructed by linking the gene or gene fragment in 
reverse orientation to plant promoter sequences. Either the co-suppression or antisense 

15 chimeric genes could be introduced into plants via transformation wherein expression of the 
corresponding endogenous genes are reduced or eliminated. 

Molecular genetic solutions to the generation of plants with altered gene expression 
have a decided advantage over more traditional plant breeding approaches. Changes in plant 
phenotypes can be produced by specifically inhibiting expression of one or more genes by 

20 antisense inhibition or cosuppression (U.S. Patent Nos. 5,190,931, 5,107,065 and 

5,283,323). An antisense or cosuppression construct would act as a dominant negative 
regulator of gene activity. While conventional mutations can yield negative regulation of 
gene activity these effects are most likely recessive. The dominant negative regulation 
available with a transgenic approach may be advantageous from a breeding perspective. In 

25 addition, the ability to restrict the expression of specific phenotype to the reproductive 
tissues of the plant by the use of tissue specific promoters may confer agronomic 
advantages relative to conventional mutations which may have an effect in all tissues in 
which a mutant gene is ordinarily expressed. 

The person skilled in the art will know that special considerations are associated with 

30 the use of antisense or cosuppression technologies in order to reduce expression of particular 
genes. For example, the proper level of expression of sense or antisense genes may require 
the use of different chimeric genes utilizing different regulatory elements known to the 
skilled artisan. Once transgenic plants are obtained by one of the methods described above, 
it will be necessary to screen individual transgenics for those that most effectively display 

35 the desired phenotype. Accordingly, the skilled artisan will develop methods for screening 
large numbers of transformants. The nature of these screens will generally be chosen on 
practical grounds, and is not an inherent part of the invention. For example, one can screen 
by looking for changes in gene expression by using antibodies specific for the protein 
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encoded by the gene being suppressed, or one could establish assays that specifically 
measure enzyme activity. A preferred method will be one which allows large numbers of 
samples to be processed rapidly, since it will be expected that a large number of 
transformants will be negative for the desired phenotype. 

5 The instant polypeptides (or portions thereof) may be produced in heterologous host 

cells, particularly in the cells of microbial hosts, and can be used to prepare antibodies to the 
these proteins by methods well known to those skilled in the art. The antibodies are useful 
for detecting the polypeptides of the instant invention in situ in cells or in vitro in cell 
extracts. Preferred heterologous host cells for production of the instant polypeptides are 

10 microbial hosts. Microbial expression systems and expression vectors containing regulatory 
sequences that direct high level expression of foreign proteins are well known to those 
skilled in the art. Any of these could be used to construct a chimeric gene for production of 
the instant polypeptides. This chimeric gene could then be introduced into appropriate 
microorganisms via transformation to provide high level expression of the encoded proteins 

15 involved in carbon catabolite repression. An example of a vector for high level expression 
of the instant polypeptides in a bacterial host is provided (Example 6). 

All or a substantial portion of the nucleic acid fragments of the instant invention may 
also be used as probes for genetically and physically mapping the genes that they are a part 
of, and as markers for traits linked to those genes. Such information may be useful in plant 

20 breeding in order to develop lines with desired phenotypes. For example, the instant nucleic 
acid fragments may be used as restriction fragment length polymorphism (RFLP) markers. 
Southern blots (Maniatis) of restriction-digested plant genomic DN A may be probed with 
the nucleic acid fragments of the instant invention. The resulting banding pattems may then 
be subjected to genetic analyses using computer programs such as MapMaker (Lander et al. 

25 (19^7) Genomics 7:174-181) in order to construct a genetic map. In addition, the nucleic 
acid fragments of the instant invention may be used to probe Southern blots containing 
restriction endonuclease-treated genomic DNAs of a set of individu£ils representing parent 
and progeny of a defined genetic cross. Segregation of the DNA polymorphisms is noted 
and used to calculate the position of the instant nucleic acid sequence in the genetic map 

30 previously obtained using this population (Botstein et al. (1980) ^m. J. Hum. Genet. 
52:314-331). 

The production and use of plant gene-derived probes for use in genetic mapping is 
described in Bematzky and Tanksley (1986) Plant Mol Biol Reporter ^;37-41 . Numerous 
publications describe genetic mapping of specific cDNA clones using the methodology 
35 outlined above or variations thereof. For example, F2 intercross populations, backcross 

populations, randomly mated populations, near isogenic lines, and other sets of individuals 
may be used for mapping. Such methodologies are well known to those skilled in the art. 
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Nucleic acid probes derived from the instant nucleic acid sequences may also be used 
for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel et al. In: 
Nonmammalian Genomic Analysis: A Practical Guide, Academic press 1996, pp. 319-346, 
and references cited therein). 

5 In another embodiment, nucleic acid probes derived from the instant nucleic acid 

sequences may be used in direct fluorescence in situ hybridization (FISH) mapping (Trask 
(1991) Trends Genet. 7:149-1 54). Although current methods of FISH mapping favor use of 
large clones (several to several hundred KB; see Laan et al. (1995) Genome Res, 5:13-20), 
improvements in sensitivity may allow performance of FISH mapping using shorter probes. 

10 A variety of nucleic acid amplification-based methods of genetic and physical 

mapping may be carried out using the instant nucleic acid sequences. Examples include 
ailele-specific amplification (Kazazian (1989) J. Lab. Clin, Med. 77:95-96), polymorphism 
of PCR-amplified fragments (CAPS; Sheffield et al. (1993) Genomics 7(5:325-332), ailele- 
specific ligation (Landegren et al. (1988) Science 247:1077-1080), nucleotide extension 

15 reactions (Sokolov (1990) Nucleic Acid Res. 75:3671), Radiation Hybrid Mapping (Walter 
et al. (1997) Nat. Genet. 7:22-28) and Happy Mapping (Dear and Cook (1989) Nucleic Acid 
Res. 1 7:6795-6807). For these methods, the sequence of a nucleic acid fragment is used to 
design and produce primer pairs for use in the amplification reaction or in primer extension 
reactions. The design of such primers is well known to those skilled in the art. In methods 

20 employing PCR-based genetic mapping, it may be necessary to identify DNA sequence 
differences between the parents of the mapping cross in the region corresponding to the 
instant nucleic acid sequence. This, however, is generally not necessary for mapping 
methods. 

Loss of function mutant phenotypes may be identified for the instant cDNA clones 
25 either by targeted gene disruption protocols or by identifying specific mutants for these 
genes contained in a maize population carrying mutations in all possible genes (Ballinger 
and Benzer (1989) Proc. Natl Acad. Sci USA 55:9402-9406; Koes et al. (1995) Proc. Natl. 
Acad, Sci USA 92:8149-8153; Bensen et al. (1995) Plant Cell 7:75-84). The latter approach 
may be accomplished in two ways. First, short segments of the instant nucleic acid 
30 fragments may be used in polymerase chain reaction protocols in conjunction with a 
mutation tag sequence primer on DNAs prepared from a population of plants in which 
Mutator transposons or some other mutation-causing DNA element has been introduced (see 
Bensen, supra). The amplification of a specific DNA fragment with these primers indicates 
the insertion of the mutation tag element in or near the plant gene encoding the instant 
35 polypeptides. Alternatively, the instant nucleic acid fragment may be used as a 

hybridization probe against PGR amplification products generated from the mutation 
population using the mutation tag sequence primer in conjunction with an arbitrary genomic 
site primer, such as that for a restriction enzyme site-anchored synthetic adaptor. With 
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either method, a plant containing a mutation in the endogenous gene encoding the instant 
polypeptides can be identified and obtained. This mutant plant can then be used to 
detemiine or confirm the natural function of the instant polypeptides disclosed herein. 

EXAMPLES 

The present invention is further defined in the following Examples, in which all parts 
and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be 
understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 
skilled in the art can ascertain the essential characteristics of this invention, and without 
departing from the spirit and scope thereof, can make various changes and modifications of 
the invention to adapt it to various usages and conditions. 

EXAMPLE 1 

Composition of cDNA Libraries: Isolation and Sequencing of cDNA Clones 
cDNA libraries representing mRNAs from various com, rice, soybean and wheat 
tissues were prepared. The characteristics of the libraries are described below. 



TABLE 2 

cDNA Libraries from Com, Rice, Soybean and Wheat 



Library 



Tissue 



cen3n 
p0016 
p0118 



rda 
rrl 

sdp4c 
sdrlf 
sgs4c 

srrlc 
wdk2c 



Cora Endosperm 20 Days After Pollination* 
Com Tassel Shoots, Pooled, 0.1-L4 cm 

Com Stem Tissue Pooled From the 4-5 Intemodes 

Subtending The Tassel At Stages V8-V12**, Night 
harvested* 

Rice 15 day old leaf* 

Rice Root of Two Week Old Developing Seedling 
Soybean Developing Pods (10-12 mm) 

Soybean Seeds 2 Days After Germination 
Soybean 8-Day-Old Root 

Wheat Developing Kernel, 7 Days After Anthesis 



Clone 



cen3n.pk0044.b8 

p0016.ctsbf56rb 

p0118.chsbh89r 



rda.pk0007.g3 

rrl.pk0008.el2 
rrl.pk0047.gl2 

sdp4c.pk016.el0 

sdrlf.pk001.p7 

sgs4c.pk006.g6 
sgs4c.pk006.n21 

srrlc.pk001.i24 
wdk2c.pk018.cl6 
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Library 


Tissue 


Clone 


wlml 


Wheat Seedlings 1 Hour After Inoculation With Erysiphe 


whnl.pk0027.al2 




graminis f. sp tritici 




wlm96 


Wheat Seedlings 96 Hours After Inoculation With Erysiphe 


wlni96.pk0007.e4 




graminis f, sp tritici 





*These libraries were normalized essentially as described in U.S. Patent No. 5,482,845, 
incorporated herein by reference. 

**Com developmental stages are explained in the publication "How a com plant develops" 
5 from the Iowa State University Coop. Ext. Service Special Report No. 48 reprinted 
June 1993. 

cDNA libraries may be prepared by any one of many methods available. For 
example, the cDNAs may be introduced into plasmid vectors by first preparing the cDNA 

10 libraries in Uni-ZAP™ XR vectors according to the manufacturer's protocol (Stratagene 
Cloning Systems, La Jolla, CA). The Uni-ZAP™ XR libraries are converted into plasmid 
libraries according to the protocol provided by Stratagene. Upon conversion, cDNA inserts 
will be contained in the plasmid vector pBluescript. In addition, the cDNAs may be 
introduced directly into precut Bluescript II SK(+) vectors (Stratagene) using T4 DNA 

15 ligase (New England Biolabs), followed by transfection into DHIOB cells according to the . 
manufacturer's protocol (GIBCO BRL Products). Once the cDNA inserts are in plasmid 
vectors, plasmid DNAs are prepared from randomly picked bacterial colonies containing 
recombinant pBluescript plasmids, or. the insert cDNA sequences are amplified via 
polymerase chain reaction using primers specific for vector sequences flanking the inserted 

20 cDNA sequences. Amplified insert DNAs or plasmid DNAs are sequenced in dye-primer 
sequencing reactions to generate partial cDNA sequences (expressed sequence tags or 
"ESTs"; see Adams et al., {\99\) Science 252:1651-1656). The resulting ESTs are analyzed 
using a Perkin Elmer Model 377 fluorescent sequencer. 

EXAMPLE 2 



25 Identification of cDNA Clones 

cDNA clones encoding proteins involved in carbon catabolite repression were 
identified by conducting BLAST (Basic Local Alignment Search Tool; Altschul et al. (1993) 
J. Mol Biol 275:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity 
to sequences contained in the BLAST "nr" database (comprising all non-redundant 

30 GenBank CDS translations, sequences derived fi-om the 3-dimensional structure Brookhaven 
Protein Data Bank, the last major release of the SWISS-PROT protein sequence database, 
EMBL, and DDBJ databases). The cDNA sequences obtained in Example 1 were analyzed 
for similarity to all publicly available DNA sequences contained in the "nr" database using 
the BLASTN algorithm provided by the National Center for Biotechnology Information 
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(NCBI). The DNA sequences were translated in all reading frames and compared for 
similarity to all publicly available protein sequences contained in the "nr" database using the 
BLASTX algorithm (Gish and States (1993) Nat. Genet, 5:266-272) provided by the NCBI. 
For convenience, the P-value (probability) of observing a match of a cDNA sequence to a 
sequence contained in the searched databases merely by chance as calculated by BLAST are 
reported herein as "pLog" values, which represent the negative of the logarithm of the 
reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the 
cDNA sequence and the BLAST "hit" represent homologous proteins. 

EXAMPLE 3 

Characterization of cDNA Clones Encoding Proteins Involved 

in Carbon Catabolite Repression 
The BLASTX search using the EST sequences from clones listed in Table 3 revealed 
similarity of the polypeptides encoded by the cDNAs to SNFl from Arabidopsis thaliana 
(NCBI Identifier No. gi 505 1 782), Arabidopsis thaliana (NCBI Identifier No. gi 4895200), 
Arabidopsis thaliana (NCBI Identifier No. gi 2980770), Arabidopsis thaliana (NCBI 
Identifier No. gi 3885328), Cucumis sativus (NCBI Identifier No. gi 1743009), Glycine max 
(NCBI Identifier No. gi 4567091), Oryza sativa (NCBI Identifier No. gi 4107001), Oryza 
sativa (NCBI Identifier No. gi 4107009). Shown in Table 3 are the BLAST results for 
individual ESTs ("EST"), the sequences of the entire cDNA inserts comprising the indicated 
cDN A clones ("FIS'O? contigs assembled from two or more ESTs ("Contig"), contigs 
assembled from an FIS and one or more ESTs ("Contig*")» or sequences encoding the entire 
protein derived from an FIS, a contig, or an FIS arid PGR ("COS"): 

TABLE 3 



BLAST Results for Sequences Encoding Polypeptides Homologous to Arabidopsis thaliana. 
Cucumis sativus. Glycine max, and Oryza sativa SNFl Proteins 



Clone 


Status 


BLAST pLog Score 


cen3n;pk0044.b8 


FIS 


>254.00(gi 5051782) 


p0bl6.ctsbf56rb 


CGS 


>254.00 (gi 4107001) 


p0118.chsbh89r 


CGS 


>254.00 (gi 4107009) 


Contig composed of: 

rda.pk0007.g3 

iTl.pk0008.el2 

rrl.pk0047.gl2 


Contig 


58.30 (gi 4895200) 


sdp4c.pk016.el0 


CGS 


>254.00 (gi 2980770) 


sdrlf.pk001.p7 


CGS 


>254.00 (gi 1 743009) 


sgs4c.pk0G6.g6 


CGS 


>254.00 (gi 2980770) 


sgs4c.pk006.n2 1 


CGS 


>254.00(gi 4567091) 


srrlc.pk001.i24 


FIS 


>254.00 (gi 38853228) 
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Clone 


Status 


BLAST pLog Score 




Contig composed of: 


Contig 


158.00 (gi 4107001) 




wdk2c.pk018.cl6 








wlml.pk0027.al2 








wlm96.pk0007.e4 


FIS 


>254.00 (4107009) 





The data in Table 4 represents a calculation of the percent identity of the amino acid 
sequences set forth in SEQ ID N0s:2, 4. 6, 8, 10, 12, 14, 16, 18, 20 and 22 and the 
5 Arabidopsis thaliana^ Cucumis sativus. Glycine max, and Oryza sativa sequences. 

TABLE 4 

Percent Identity of Amino Acid Sequences Deduced From the Nucleotide Sequences 
of cDNA Clones Encoding Polypeptides Homologous to Arabidopsis thaliana^ Cucumis 
10 • sativus. Glycine max, and Oryza sativa SNF 1 Proteins 



SEQ ID NO. Percent Identity 



2 


73% (gi 5051782) 


4 


89% (gi 4107001) 


6 


88% (gi 4107009) 


8 


61% (gi 4895200) 


10 


74% (gi 2980770) 


12 


78% (gi 1743009) 


14 


72% (gi 2980770) 


16 


98% (gi 4567091) 


18 


82% (gi 3885328) 


20 


93% (gi 4107001) 


22 . 


81% (gi 4107009) 



Sequence alignments and percent identity calculations were performed using the 
Megalign program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., 
Madison, WI). Multiple alignment of the sequences was performed using the Clustal 

15 method of alignment (Higgins and Sharp ( 1 989) CABIOS. 5:151-153) with the default 

parameters (GAP PENALTY- 10, GAP LENGTH PEN ALT Y= 10). Default parameters for 
pairwise alignments using the Clustal method were KTUPLE 1 , GAP PENALTY=3, 
WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments and BLAST scores and 
probabilities indicate that the nucleic acid fragments comprising the instant cDNA clones 

20 encode a substantial portion of a SNFl protein. These sequences represent the first com, 
soybean and wheat sequences and a new rice sequence encoding SNFl proteins. 
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EXAMPLE 4 
Expression of Chimeric Genes in Monocot Cells 
A chimeric gene comprising a cDNA encoding the instant polypeptides in sense 
orientation with respect to the maize 27 kD zein promoter that is located 5' to the cDNA 

5 fragment, and the 10 kD zein 3* end that is located 3' to the cDNA fragment, can be 
constructed. The cDNA fragment of this gene may be generated by polymerase chain 
reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites 
(Ncol or Smal) can be incorporated into the oligonucleotides to provide proper orientation 
of the DNA fragment when inserted into the digested vector pML103 as described below. 

10 Amplification is then performed in a standard PCR. The amplified DNA is then digested 

with restriction enzymes Ncol and Smal and fractionated on an agarose gel. The appropriate 
band can be isolated from the gel and combined with a 4.9 kb Ncol-Smal fragment of the 
plasmid pML103. Plasmid pML103 has been deposited under the terms of the Budapest 
Treaty at ATCC (American Type Culture Collection, 10801 University Blvd., Manassas, 

15 VA 201 10-2209), and bears accession number ATCC 97366. The DNA segment from 

pML 103 contains a 1.05 kb Sall-Ncol promoter fragment of the hiaize 27 kD zein gene and 
a 0.96 kb Smal-Sall fragment from the 3* end of the rnaize 10 kD zein gene in the vector 
pGem9Zf(+) (Promega). Vector and insert DNA can be ligated at 15°C overnight, 
essentially as described (Maniatis). The ligated DNA may then be used to transform E, coli 

20 XL 1 -Blue (Epicurian Coli XL-1 Blue^" ; Stratagene). Bacterial transformants can be 

screened by restriction enzyme digestion of plasmid DNA and limited nucleotide sequence 
analysis using the dideoxy chain termination method (Sequenase"^" DNA Sequencing Kit; 
U.S. Biochemical). The resulting plasmid construct would comprise a chimeric gene 
encoding, in the 5' to 3' direction, the maize 27 kD zein promoter, a cDNA fragment 

25 encoding the instant polypeptides, and the 10 kD zein 3' region. 

The chimeric gene described above can then be introduced into com cells by the 
following procedure. Immature com embryos can be dissected from developing caryopses 
derived from crosses of the inbred com lines H99 and LH132. The embryos are isolated 10 
to 1 1 days after pollination when they are 1.0 to 1 .5 mm long. The embryos are then placed 

30 with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu et al. 
(1975) ScL Sin. Peking 1 8:659-668). The embryos are kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses of cells with somatic 
proembryoids and embryoids borne on suspensor structures proliferates from the scutellum 
of these immature embryos. The embryogenic callus isolated from the primary explant can 

35 be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks. 

The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, Frankfurt, 
Germany) may be used in transformation experiments in order to provide for a selectable 
marker. This plasmid contains the Pat gene (see European Patent Publication 0 242 236) 
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which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers 
resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat 
gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus 
(Odell et ai. (1985) Nature 3 13:810-812) and the 3* region of the nopaline synthase gene 
5 from the T-DN A of the Ti plasmid of Agrobacterium tumefaciens. 

The particle bombardment method (Klein et al. (1987) Nature ^ll-JQ-lJ) may be used 
to transfer genes to the callus culture cells. According to this method, gold particles (1 ^im 
in diameter) are coated with DNA using the following technique. Ten jag of plasmid DNAs 
are added to 50 |iL of a suspension of gold particles (60 mg per mL). Calcium chloride 

10 (50 |iL of a 2.5 M solution) and spermidine free base (20 |j.L of a 1.0 M solution) are added 
to the particles. The suspension is vortexed during the addition of these solutions. After 
10 minutes, the tubes are briefly centrifuged (5 sec at 15,000 rpm) and the supernatant 
removed. The particles are resuspended in 200 |aL of absolute ethanol, centrifuged again 
and the supernatant removed. The ethanol rinse is performed again and the particles 

15 resuspended in a final volume of 30 |j.L of ethanol. An aliquot (5 jaL) of the DNA-coated 
gold particles can be placed in the center of a Kapton^*^ flying disc (Bio-Rad Labs). The 
particles are then accelerated into the com tissue with a Biolistic™ PDS-lOOO/He (Bio-Rad 
Instruments, Hercules CA), using a helium pressure of 1000 psi, a gap distance of 0.5 cm 
and a flying distance of 1 .0 cm. 

20 For bombardment, the embryogenic tissue is placed on filter paper over agarose- 

solidified N6 medium. The tissue is arranged as a thin lawn and covered a circular area of 
about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of 
the PDS- 1 000/He approximately 8 cm from the stopping screen. The air in the chamber is 
then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a 

25 heliimi shock wave using a rupture membrane that bursts when the He pressure in the shock 
tube reaches 1000 psi. 

Seven days after bombardment the tissue can be transferred to N6 medium that 
contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to 
grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to 

30 fresh N6 medium containing gluphosinate. After 6 weeks, areas of about 1 cm in diameter 
of actively growing callus can be identified on some of the plates containing the glufosinate- 
supplemented medium. These calli may continue to grow when sub-cultured on the 
selective medium. 

Plants can be regenerated from the transgenic callus by first transferring clusters of 
35 tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the 
tissue can be transferred to regeneration medium (Fromm et al. (1990) Bio/T echnology 
5:833-839). 
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EXAMPLE 5 
Expression of Chimeric Genes in Dicot Cells 
A seed-specific expression cassette composed of the promoter £ind transcription 
terminator firom the gene encoding the p subunit of the seed storage protein phaseolin from 
5 the bean Phaseolus vulgaris (Doyle et al. (1986) J. Biol. Chem, 261 :9228-9238) can be used 
for expression of the instant polypeptides in transformed soybean. The phaseolin cassette 
includes about 500 nucleotides upstream (5*) from the translation initiation codon and about 
1650 nucleotides downstream (3*) from the translation stop codon of phaseolin. Between the 
5' and 3* regions are the unique restriction endonuclease sites Nco I (which includes the ATG 
10 translation initiation codon), Sma I, Kpn I and Xba I. The entire cassette is flanked by 
Hind III sites. 

The cDNA fragment of this gene may be generated by polymerase chain reaction 
(PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be 
incorporated into the oligonucleotides to provide proper orientation of the DNA fragment 
15 when inserted into the expression vector. Amplification is then perfomied as described 
above, and the isolated fragment is inserted into a pUC 1 8 vector carrying the seed 
expression cassette. 

Soybean embryos may then be transformed with the expression vector comprising 
sequences encoding the instant polypeptides. To induce somatic embryos, cotyledons, 

20 3-5 mm in length dissected from surface sterilized, immature seeds of the soybean cultivar 
A2872, can be cultured in the light or dark at 26°C on an appropriate agar medium for 
6-10 weeks; Somatic embryos which produce secondary embryos are then excised and 
placed into a suitable liquid medium. After repeated selection for clusters of somatic 
embryos which multiplied as early, globular staged embryos, the suspensions are maintained 

25 as described below. 

Soybean embryogenic suspension cultures can maintained in 35 mL liquid media on a. 
rotary shaker, 150 rpm, at 26^C with florescent lights on a 16:8 hour day/night schedule. 
Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 
35 mL of liquid medium. 

30 Soybean embryogenic suspension cultures may then be transformed by the method of 

particle gim bombardment (Klein et al. (1 987) Nature (London) 527:70-73, U.S. Patent 
No. 4,945,050). A DuPont Biolistic™ PDSIOOO/HE instrument (helium retrofit) can be used 
for these transformations. 

A selectable marker gene which can be used to facilitate soybean transformation is a 

35 chimeric gene composed of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. 
(1985) Nature 37i:810-812), the hygromycin phosphotranisferase gene from plasmid pJR225 
(from E. coli; Gritz et al.(1983) Gene 25:179-188) and the 3* region of the nopaline synthase 
gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens , The seed expression 
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cassette comprising the phaseolin 5* region, the fragment encoding the instant polypeptides 
and the phaseolin 3* region can be isolated as a restriction fragment. This fragment can then 
be inserted into a unique restriction site of the vector carrying the marker gene. 

To 50 jiL of a 60 mg/mL 1 \im gold particle suspension is added (in order): 5 ^iL 
5 DNA (1 |ig/|aL), 20 |il spermidine (0.1 M), and 50 CaCl2 (2.5 M). The particle 

preparation is then agitated for three minutes, spun in a microfuge for 10 seconds and the 
supernatant removed. The DNA-coated particles are then washed once in 400 fiL 70% 
ethanol and resuspended in 40 jaL of anhydrous ethanoL The DNA/particle suspension can 
be sonicated three times for one second each. Five |iL of the DNA-coated gold particles are 

10 then loaded on each macro carrier disk. 

Approximately 300-400 mg of a two- week-old suspension culture is placed in an 
empty 60x1 5 mm petri dish and the residual liquid removed from the tissue with a pipette. 
For each transformation experiment, approximately 5-10 plates of tissue are normally 
bombarded. Membrane rupture pressure is set at 1 1 00 psi and the chamber is evacuated to a 

15 vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the 
retaining screen and bombarded three times. Following bombardment, the tissue can be 
divided in half and placed back into liquid and cultured as described above. 

Five to seven days post bombardment, the liquid media may be exchanged with fresh 
media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL 

20 hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post 
bonibardment, green, transformed tissue may be observed growing from untransformed, 
necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into 
individual flasks to generate new, clonally propagated, tr£uisformed embryogenic suspension 
cultures. Each new line may be treated as an independent transformation event. These 

25 suspensions can then be subcultured and maintained as clusters of immature embryos or 

regenerated into whole plants by maturation and germination of individual somatic embryos. 

EXAMPLE 6 
Expression of Chimeric Genes in Microbial Cells 
The cDNAs encoding the instant polypeptides can be inserted into the T7 E, coli 

30 expression vector pBT430. This vector is a derivative of pET-3a (Rosenberg et al. (1987) 
Gene 56:125-135) which employs the bacteriophage T7 RNA polymerase/T7 promoter 
system. Plasmid pBT430 was constructed by first destroying the EcoR I and Hind III sites in 
pET-3a at their original positions. An oligonucleotide adaptor containing EcoR I and 
Hind III sites was inserted at the BamH I site of pET-3a. This created pET-3aM with 

35 additional unique cloning sites for insertion of genes into the expression vector. Then, the 
Nde I site at the position of translation initiation was converted to an Nco I site using 
oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 
5*-CATATGG, was converted to 5*-CCCATGG in pBT430. 
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Plasmid DNA containing a cDN A may be appropriately digested to release a nucleic 
acid fragment encoding the protein. This fragment may then be purified on a 1% NuSieve 
QjQTM low melting agarose gel (FMC). Buffer and agarose contain 10 fxg/ml ethidium 
bromide for visualization of the DNA fragment. The fragment can then be purified from the 
agarose gel by digestion with GELase"^" (Epicentre Technologies) according to the 
manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 ^L of water. 
Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated adapters can be 
purified from the excess adapters using low melting agarose as described above. The vector 
pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and deproteinized 
with phenol/chloroform as described above. The prepared vector pBT430 and fragment can 
then be ligated at 16°C for 15 hours followed by transformation into DH5 electrocompetent 
cells (GIBCO BRL). Transformants can be selected on agar plates containing LB media and 
100 M-g/mL ampicillin. Transformants containing the gene encoding the instant polypeptides 
are then screened for the correct orientation with respect to the T7 promoter by restriction 
enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
orientation relative to the T7 promoter can be transformed into E: coli strain BL21(DE3) 
(Studieretal. (1986) J. MoL Biol 759:1 13-130). Cultures are grown in LB medium 
containing ampicillin (100 mg/L) at 25^C. At an optical density at 600 nm of approximately 
1, IPTG (isopropylthio-P-galactoside, the inducer) can be added to a final concentration of 
0.4 mM and incubation can be continued for 3 h at 25*^. Cells are then harvested by 
centrifiigation and re-suspended in 50 \xL of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM 
DTT and 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can 
be added and the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centriftiged and the protein concentration of the supernatant 
determined. One jag of protein from the soluble fraction of the culture can be separated by 
SDS-polyacrylamide gel electrophoresis. Gels can be observed for protein bands migrating 
at the expected molecular weight. 

Various modifications of the invention in addition to those shown and described herein 
will be apparent to those skilled in the art from the foregoing description. Such 
modifications are also intended to fall within the scope of the appended claims. 

The disclosure of each reference set forth above is incorporated herein by reference in 
its entirety. 
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CLAIMS 

What is claimed is: 

1 . An isolated polynucleotide comprising a first nucleotide sequence encoding a 
polypeptide of at least 240 amino acids that has at least 90% identity based on the Clustal 

5 method of alignment when compared to a polypeptide selected from the group consisting of 
SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20 and 22, or a second nucleotide sequence 
comprising the complement of the first nucleotide sequence. 

2. The isolated polynucleotide of Claim 1 , wherein the first nucleotide sequence 
consists of a nucleic acid sequence selected from the group consisting of SEQ ID NQsrl, 3, 

10 5, 7, 9, 1 1, 13, 15, 17, 19 and 21 that codes for the polypeptide selected from the group 
consisting of SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20 and 22. 

3. The isolated polynucleotide of Claim 1 wherein the nucleotide sequences are 

DNA. 

4. The isolated polynucleotide of Claim 1 wherein the nucleotide sequences are 

15 RNA. 

5. A chimeric gene comprising the isolated polynucleotide of Claim 1 operably 
linked to suitable regulatory sequences. 

6. An isolated host cell comprising the chimeric gene of Claim 5. 

7. A host cell comprising an isolated polynucleotide of Claim 1 . 

20 8. The host cell of Claim 7 wherein the host cell is selected from the group 

consisting of yeast, bacteria, plant, and virus. 

9. A virus comprising the isolated polynucleotide of Claim 1 . 

10. A polypeptide of at least 240 amino acids that has at least 90% identity based on 
the Clustal method of alignment when compared to a polypeptide selected from the group 

25 consisting of SEQ ID NOs:2, 4, 6, 8, 10, 12, 14, 16, 18, 20 and 22. 

11. A method of selecting an isolated polynucleotide that affects the level of 
expression of a carbon catabolite repression polypeptide in a plant cell, the method 
comprising the steps of: 

(a) constructing an isolated polynucleotide comprising a nucleotide sequence of 
30 at least one of 30 contiguous nucleotides derived from an isolated polynucleotide of 

Claim 1; 

(b) introducing the isolated polynucleotide into a plant cell; 

(c) measuring the level of a polypeptide in the plant cell containing the 
polynucleotide to provide a positive selection means. 

35 12. The method of Claim 1 1 wherein the isolated polynucleotide consists of a 

nucleotide sequence selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 
15, 17, 19 and 21 that codes for the polypeptide selected from the group consisting of SEQ 
ID NOs:2, 4, 6, 8, 1 0, 1 2, 1 4, 1 6, 1 8, 20 and 22. 

27 



wo 00/36115 



PCT/US99/29824 



13. A method of selecting an isolated polynucleotide that affects the level of 
expression of a carbon catabolite repression polypeptide in a plant cell, the method 
comprising the steps of: 

(a) constructing an isolated polynucleotide of Claim 1 ; 

(b) introducing the isolated polynucleotide into a plant cell; and 

(c) measuring the level of polypeptide in the plant cell containing the 
polynucleotide to provide a positive selection means. 

14. A method of obtaining a nucleic acid fragment encoding a carbon catabolite 
repression polypeptide comprising the steps of: 

(a) synthesizing an oligonucleotide primer comprising a nucleotide sequence of 
at least one of 30 contiguous nucleotides derived from a sequence selected from the group 
consisting of SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, and the complement of such 
nucleotide sequences; and 

(b) amplifying a nucleic acid sequence using the oligonucleotide primer. 

15. A method of obtaining a nucleic acid fragment encoding a carbon catabolite 
repression polypeptide comprising the steps of: 

(a) probing a cDNA or genomic library v^ith an isolated polynucleotide 
comprising at least one of 30 contiguous nucleotides derived from a nucleotide sequence 
selected from the group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, and 
the complement of such nucleotide sequences; 

4 

(b) identifying a DNA clone that hybridizes with the isolated polynucleotide; 

(c) isolating the identified DNA clone; and 

(d) sequencing the cDN A or genomic fragment that comprises the isolated DNA 

clone. 

16. A composition comprising the isolated polynucleotide of Claim 1 . 

17. An isolated polynucleotide comprising the nucleotide sequence having at least 
one of 30 contiguous nucleotides derived from a nucleic acid sequence selected from the 
group consisting of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, and the complement of 
such sequences. 

18. An expression cassette comprising an isolated polynucleotide of Claim 1 
operably linked to a promoter. 

19. A method for positive selection of a transformed cell comprising: 

(a) transforming a host cell With the chimeric gene of Claim 5 or an expression 
cassette of Claim 18; and 

(b) growing the transformed host cell under conditions which allow expression 
of the polynucleotide in an amount sufficient to alter expression of glucose repressible genes 
to provide a positive selection means. 

20. The method of Claim 19 wherein the host is a plant cell. 
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21 The method of Claim 19 wherein the plant cell is a dicot or a monocot. 

22. An isolated polynucleotide comprising a first nucleotide sequence encoding a 
polypeptide of at least 77 amino acids that has at least 85% identity based on the Clustal 
method of alignment when compared to a polypeptide selected from the group consisting of 

5 SEQ ID NOs:24, 26 and 28, or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

23. A polypeptide of at least 77 amino acids that has at least 85% identity based on 
the Clustal method of alignment when compared to a polypeptide selected from the group 
consisting of SEQ ID NOs:24, 26 and 28, or a second nucleotide sequence comprising the 

10 complement of the first nucleotide sequence. 
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SEQUENCE LISTING 

<110> E. I. du Pont de Nemours and Company 
Pioneer Hi-Bred International, Inc.. 

<120> Plant Carbon Catabolite Repression Proteins 

<130> BB1315 

<140> 
<141> 

<150> 60/112,563 
<151> 1998-12-16 

<160> 28 

<170> Microsoft Office 97 . 

<210> 1 

<211> 1707 

<212> DNA 

<213> Zea mays 

<400> 1 

gcacgaggag cagctcccct gcccctcgca gcggctactc tacaggtcta gcgactcttt 60 

cgccatccat agagggagga ggcgcggcgg agatggtggg cggtggcggc ggcgggccgc 120 

tgcggcgggt gggcaagtac gaggtgggac gcaccatcgg ggaaggcacc ttcgccaagg 180 

tcaagttcgc gcagaacacc gagaccgggg agagcgtcgc catgaaggtg ctcgaccgct 240 

cctccatcct caagaacaag atggccgaac agattaagag agaaatatcc ataatgaagc 300 

ttgtcaggca tcccaatgtc gttaggctac acgaggtttt ggcaagccgg aagaagatat 360 

ttataattct ggagttcatc actggcggcg agctattcga taaaattatt cgtcatggga 420 

gactcagtga agcagatgcc cgcagatact ttcagcagct tattgatggt gttgattttt 480 

gtcacaagaa aggagtctac catcgagact taaagcctga aaatctcctc cttgattccc 540 

aaggcaatct taaaatttca gactttggac tgagtgcatg gcctgctcag ggatctttcc 600 

ttctacgtac aacctgtggg actccgaact atgttgcccc agaggttctc agtcataagg 660 

gatataatgg cgcacttgct gatacatggt catgtggagt aattttatat gtattgttag 720 

caggttatct tccatttgat gaagtggacc tgactaccct ttatggaaag attgagagtg 780 

cagaatattc attcccagcc tggttttcag gtggtgcaaa atcactgatt cgtagaattc 840 

ttgacccaaa tccagagaca cgaatcagga tagaagagat cagaagcgat gaatggtttc 900 

agaaaaatta tgaacctatc aaagaaatag aaaatgagga agtcaatctc gatgatgtca 960 

atgcagcttt tgatgatcct gaggacgata atgaggatgc tttcgaagat gaaacaggtc 1020 

ctctaacact taatgcattc gacctaatca ttctttcgca aggattgaat cttgcagcgc 1080 

tctttgaccg cagacaggac tgtgacaagc ttcaaaatag attcttatca cgcaatccag 1140 

caaaggttat cttgtcaagc atggaggttg ttgcgcaatc aatgggattt aagacacaca 1200 

ttcgcaatta taagatgagg gtggaaggtc taaatgccga caagactagt catctctcag 1260 

ttatggttga agttttcgaa gttgctccat caatcttcat ggtagaacta caaagagcag 1320 

caggagatac ttcagagtat aacacgttcg taaataacta ctgcggcaaa ttagatgata 1380 

tcatctggaa atttccaact gagaagggaa aatcaaggat accccggtta tcaaagtctc 1440 

attcataatc tagtgctgtg tgttatcaat tactatggac ttaagtggat atatttgcag 1500 

agtgtatcgc aattggatta tgatattcta agtgtagctc ttcgagtagt atatgtttgt 1560 

atattcagag tccccatgta caaagaggcc gttttaattg aggtttccgt acctcgattc 1620 

cctgtgcaaa gttttcatcg atgttttaga ctgcacatct gtatattttg tcgagagcat 1680 

cagtttctta aaaaaaaaaa aaaaaaa 1707 

<210> 2 

<211> 481 

<212> PRT 

<213> Zea mays 
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<400> 2 

Thr Arg Ser Ser Ser Pro Ala Pro Arg Ser Gly Tyr Ser Thr Giy Leu 
1 5 " 10 15 

Ala Thr Leu Ser Pro Ser lie Glu Gly Gly Gly Ala Ala Glu Met Val 

20 25 30 

Gly Gly Gly Gly Gly Gly Pro Leu Arg Arg Val Gly Lys Tyr Glu Val 

35 40 45 

Gly Arg Thr lie Gly Glu Gly Thr Phe Ala Lys Val Lys Phe Ala Gin 
50 55 60 

Asn Thr Glu Thr Gly Glu Ser Val Ala Met Lys Val Leu Asp Arg Ser 
65 70 75 80 

Ser lie Leu Lys Asn Lys Met Ala Glu Gin lie Lys Arg Glu lie Ser 

85 90 95 

lie Met Lys Leu Val Arg His Pro Asn Val Val Arg Leu His Glu Val 

100 105 110 

Leu Ala Ser Arg Lys Lys lie Phe lie lie Leu Glu Phe lie Thr Gly 
115- 120 125 

Gly Glu Leu Phe Asp Lys lie lie Arg His Gly Arg Leu Ser Glu Ala 
130 135 140 

Asp Ala Arg Arg Tyr Phe Gin Gin Leu lie Asp Gly Val Asp Phe Cys 
145 150 155 160 

His Lys Lys Gly Val Tyr His Arg Asp Leu Lys Pro Glu Asn Leu Leu 

165 170 175 

Leu Asp Ser Gin Gly Asn Leu Lys lie Ser Asp Phe Gly Leu Ser Ala 

180 185 190 

Trp Pro Ala Gin Gly Ser Phe Leu Leu Arg Thr Thr Cys Gly Thr Pro 
195 200 205 

Asn Tyr Val Ala Pro Glu Val Leu Ser His Lys Gly Tyr Asn Gly Ala 
210 215 220 

Leu Ala Asp Thr Trp Ser Cys Gly Val lie Leu Tyr Val Leu Leu Ala 
225 230 235 240 

Gly Tyr Leu Pro Phe Asp Glu Val Asp Leu Thr Thr Leu Tyr Gly Lys 

245 250 255 

lie Glu Ser Ala Glu Tyr Ser Phe Pro Ala Trp Phe Ser Gly Gly Ala 

260 265 270 

Lys Ser .Leu lie Arg Arg lie Leu Asp Pro Asn Pro Glu Thr Arg lie 
275 280 285 

Arg lie Glu Glu lie Arg Ser Asp Glu Trp Phe Gin Lys Asn Tyr Glu 
290 295 300 

Pro lie Lys Glu lie Glu Asn Glu Glu Val Asn Leu Asp Asp Val Asn 
305 310 315 320 
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Ala Ala Phe Asp Asp Pro Glu Asp Asp Asn Glu Asp Ala Phe Glu Asp 

325 330 335 

Glu Thr Gly Pro Leu Thr Leu Asn Ala Phe Asp Leu lie lie Leu Ser 

340 345 . 350 

Gin Gly Leu Asn Leu Ala Ala Leu Phe Asp Arg Arg Gin Asp Cys Asp 
355 360 365 

Lys Leu Gin Asn Arg Phe Leu Ser Arg Asn Pro Ala Lys Val lie Leu 
370 . 375 380 

Ser Ser Met Glu Val Val Ala Gin Ser Met Gly Phe Lys Thr His lie 
385 390 395 400 

Arg Asn Tyr Lys Met Arg Val Glu Gly Leu Asn Ala Asp Lys Thr Ser 

405 410 415 ■ 

His Leu Ser Val Met Val Glu Val Phe Glu Val Ala Pro Ser lie Phe 

420 425 430 

Met Val Glu Leu Gin Arg Ala Ala Gly Asp Thr Ser Glu Tyr Asn Thr 
435 440 445 

Phe Val Asn Asn Tyr Cys Gly Lys Leu Asp Asp lie lie Trp Lys Phe 
450 455 460 

Pro Thr Glu Lys" Gly Lys Ser Arg lie Pro Arg Leu Ser Lys Ser His 
465 470 475 480 

Ser 



<210> 3 

<211> 1948 

<212> DNA 

<213> Zea mays 

<400> 3 

gtcgacccac gcgtccggac caaagccggg catacgcccc caagtccaaa agccctctcc 

gccccgctct cccactcgta ggtgttctcc ccgtctccgc ccgcactcgc tcgtccgcgc 

gcaggaaggt tgacctgtcg agggccggcg aacccggtaa gtaagagtga aaatggatgg 180 

aagtagtaaa gggagtgggc attctgaagc attaaggaac tacaacctgg gaagaacttt 240 
aggtatcggt acatttggaa aagtgaagat tgcagagcat aagcttactg gacatagggt 300 
tgctataaag atcatcaact gccgccaaat gagaaatatg gaaatggaag agaaagcaaa 360 
gagagaattc aagatattga agttgttcat tcacccccat atcattcggc tttatgaggt 420 
catatacaca cctacagata tatatgttgt gatggaatat tgtaagtatg gcgagttatt 480 
tgattacatt gttgagaaag gcagattaca ggaagatgaa gctcgtcgaa tcttccagca 540 
gatcatatct ggcgtcgaat actgccatag aaacatggtt gtccaccgtg acctaaagcc 600 
ggaaaacttg ttacttgatt caaagtataa tgtaaaactt gcggattttg gtctgagcaa 660 
tgtcatgcat gatggccatt ttctgaagac tagctgtggg agtccgaact atgctgctcc 720 
agaggtaata tctggtaaac tatatgctgg acctgaggtc gatgtatgga gttgtggggt 780 
gattctttat gctcttcttt gtggaactct tccatttgat gatgagaata ttcccaatct " 
gttcaaaaaa attaagggag gtatctacac acttccaagt catttgtctg ctttggccag 
ggatttgatc ccacgaatgc ttgttgttga gcctatgaag agaatcacaa ttagggaaat 

tcgggagcat caatggttcc .agattcgcct tccacgttac ttggcagtgc ctccaccaga 1020 

tacgacacaa caagccaaaa tgattgatga agatacactt cgagatgttg ttaatatggg 1080 

atttaacaag aaccatgtgt gtgaatcact gtgcagcaga cttcaaaatg aggcaactgt 1140 

tgcatattat ttactattgg acaatcggtt tagagcaact agtggctatc ttggggcaga 1200 

3 



60 
120 



840 
900 
960 
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ttatcaagaa tcaatggaca ggaatttaaa ccagctggcg tcatctgaat catctagtrc 1260 

tggtacgagg aattatgttc caggaagcag tgatcctcat agcagtggtt tgcggccata 1320 

ttatcctgtt gaaagaaaat gggcgcttgg acttcagtct cgggcccacc ctcgtgaaat 1380 

aatggttgag gtcttaaaag cacttcaaga attaaacgtc agatggaaga agaatgggca 14 40 

ctacaacgtg aaatgcagat ggtgcccagg gtttcctgaa gttaatgaca cgttagatgc 1500 

cagcaacagc tttcttggtg actctaccat catggataat gatgatgcta atgggaggct 1560 

acctactgtg atcaagtttg aattccagct ttacaagacg aaggacgaca agtacctctt 1620 

agatatgcag agagttactg gacctcagct gctctt:cctt gacttctgtg cggccttcct 1680 

taccaagctt agggttctat agtggtctac catgtgcaaa ttttcactgt ggtgatgaat 1740 

aaccgaaagc atgtaaatag gaaccttgtt ctcgtctttt ggacaacgaa catgtttgag 1800 

tgactggtct tgtgttgagc gcgtaaaggt catgtatact taggttagta ctattttcgt 1860 

tcttaaatat ttgtcgtctg ctagtgatag ttcatttttg aactaaaacg ttacgaataa 1920 

aaaaagagta aaaaaaaaaa aaaaaaaa 1948 

<210> 4 

<211> 509 

<212> PRT 

<213> Zea mays 

<400> 4 

Met Asp Gly Ser Ser Lys Gly Ser Gly His Ser Glu Ala Leu Arg Asn . 
1 5 10 15 

tyr Asn Leu Gly Arg Thr Leu Gly lie Gly Thr Phe Gly Lys- Val Lys 

20 25 30 

lie Ala Glu His Lys Leu Thr Gly His Arg Val Ala lie Lys lie lie 
35 40 45 . 

Asn Cys Arg Gin Met Arg Asn Met Glu Met Glu Glu Lys Ala Lys Arg 
50 55 60 

Glu Phe Lys lie Leu Lys Leu Phe lie His Pro His lie lie Arg Leu 
65 70 75 80 

Tyr Glu Val lie Tyr Thr Pro Thr Asp lie. Tyr Val Val Met Glu Tyr 

85 90 95 

Cys Lys Tyr Gly Glu Leu Phe Asp Tyr lie Val Glu Lys Gly Arg Leu 

100 105 110 

Gin Glu Asp Glu Ala Arg Arg lie Phe Gin Gin lie lie Ser Gly Val 
115 120 125 

Glu Tyr Cys His Arg Asn Met Val Val His Arg Asp Leu Lys Pro Glu 
130 135 140 

Asn Leu Leu Leu Asp Ser Lys Tyr Asn Val Lys Leu Ala Asp Phe Gly 
145 150 155 160 

Leu Ser Asn Val Met His Asp Gly His Phe Leu Lys Thr Ser Cys Gly 

165 170 175 

Ser Pro Asn Tyr Ala Ala Pro Glu Val lie Ser Gly Lys Leu Tyr Ala 

180 . 185 190 

Gly Pro Glu Val Asp Val Trp Ser Cys Gly Val He Leu Tyr Ala Leu 
195 200 205 
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Leu Cys Gly Thr Leu Pro Phe Asp Asp Glu Asn lie Pro Asn Leu Phe 
210 215 220 

Lys Lys lie Lys Gly Gly lie Tyr Thr Leu Pro Ser His Leu Ser Ala 
225 230 235 240 

Leu Ala Arg Asp Leu lie Pro Arg Met Leu Val Val Glu Pro Met Lys 

245 250 255 

Arg lie Thr lie Arg Glu lie Arg Glu His Gin Trp Phe Gin lie Arg 

260 265 270 

Leu Pro Arg Tyr Leu Ala Val Pro Pro Pro Asp Thr Thr Gin Gin Ala 
275 280 285 

Lys Met lie Asp Glu Asp Thr Leu Arg Asp Val Val Asn Met Gly Phe 
290 295 300 

Asn Lys Asn His Val Cys Glu Ser Leu Cys Ser Arg Leu Gin Asn Glu 
305 310 315 320 

Ala Thr Val Ala Tyr Tyr Leu Leu Leu Asp Asn Arg Phe Arg Ala Thr 

325 330 335 

Ser Gly Tyr Leu Gly Ala Asp Tyr Gin Glu Ser Met Asp Arg Asn Leu 

340 345 350 

Asn Gin Leu Ala Ser Ser Glu Ser Ser Ser Ser Gly Thr Arg Asn Tyr 
355 360 365 

Val Pro Gly Ser Ser Asp Pro His Ser Ser Gly Leu Arg Pro Tyr Tyr 
370 375 380 

Pro Val Glu Arg Lys Trp Ala Leu Gly Leu Gin Ser Arg Ala His Pro 
385 ' 390 395 400 

Arg Glu lie Met Val Glu Val Leu Lys Ala Leu Gin Glu Leu Asn Val 

405 410 415 

Arg Trp Lys Lys Asn Gly His Tyr Asn Val Lys Cys Arg Trp Cys Pro 

420 425 '430 

Gly Phe Pro Glu Val Asn Asp Thr Leu Asp Ala Ser Asn Ser Phe Leu 
435 440 445 

Gly Asp Ser Thr lie Met Asp Asn Asp Asp Ala Asn Gly Arg Leu Pro 
450 455 460 

Thr Val lie Lys Phe Glu Phe Gin Leu Tyr Lys Thr Lys Asp Asp Lys 
465 470 475 480 

Tyr Leu Leu Asp Met Gin Arg Val Thr Gly Pro Gin Leu Leu Phe Leu 

485 490 495 

Asp Phe Cys Ala Ala Phe Leu Thr Lys Leu Arg Val Leu 

500 505 

<210> 5 
<211> 2107 
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<212> DHA 
<213> Zea mays 

<400> 5 

gtcgacccac gcgtccgccc atgtgagcct cgcttttctc ctctctctct ccccggcatt 60 

tctctctctg gcgtcactcg cgtgtcaggc gcattctgcc tctttctttc tccccctccc 120 

ctcccctccc ctgcgggctt ctctcacgag ctcccccggc ctcgccgccg ccgcctgctc 180 

cggcgggcgg gacccagctc gcagcgtctc ccctgcggcg aggtacacga tggagggagc 240 

gggaagagat gccaaccctt tgagcggtta cagaattggc aaaaccctgg gaattgggtc 300 

gttcggtaaa gtgaagatcg cggaacatat attgactggt cataaggtgg cgatcaagat 360 

tctcaatcgc aagaagatca gaagcatgga tatggaagag aaagttaaga gagaaatcaa 420 

gatactgaga ttatttatgc atcctcatat catacgcctt tatgaggtga tagatacacc 480 

tgctgatatc tgtgttgtta tggagtatgt taaatctgga gagttgtttg attacatcgt 540 

tgagaaggga aggctacacg aagaggaagc ccgacacttt tttcagcaga tcatatctgg 600 

tgttgaatat tgccatagga acatggttgc tcaccgtgat ttaaagccag agaatcttct 660 

tttggattca aaatgcaatg ttaagattgc cgattttggc ttaagtaata ttatgcgtga 720 

tggtcacttt cttaagacga gttgtggtag cccgaattat gcagcacctg aggtcatatc 780 

tggtaaacta tatgctggtc ctgaagttga cgtctggagc tgtggagtta ttctttatgc 840 

•tcttctttgt ggcactctcc catttgacga tgagaatatt .ccaaaccttt tcaagaaaat 900 

aaagggtgga atatataccc ttcctagtca tttgtcacct tcagcgaggg acttgattcc 960 
cagaatgctg gttgttgatc caatgaaaag gattacaata cgtgaaatcc gtgaacatgt .1020 

gtggttcaag atccgacttc cgcgctattt ggctgtgccg cctccagaca ctgctcaaca 1080 

agttaaaaag gtcgacgagg aaactcttaa tgatgttatt aagatgggtt ttgacaagaa 1140 

tcagctaatt gaatctctgc aaaacagatt gcagaatgag gcaacagttg cctattattt 1200 

actcttggac aataggcttc gtacaaccag tggttatctt ggatctgagt ttcaagaatc 1260 

tatggactca tctttgtctc aagtaatcgc tgaaacacca acttcagcaa ctgaacttcg 1320 

tcagcatggg ttttcagaat^ ctccaggttc tggcttgagg cagcattttg cagctgaaag 1380 

gaaatgggcc cttggtcttc agtctcgagc acatccacga gaaataataa gtgaagtgct 14 40 

taaagctctg caagaactga, atgtttactg gaaaaagatt ggacactaca acatgaaatg 1500 

cagatggagt cctggctgcc ttgagagtat gatgcataac agtgatagct tcagtgcgga 1560 

gtctgctata attgaaactg atgttttcat ggagaaatca accccgacag tgaagtttga 1620 

gattcagctt tacaaaacga gggatgagaa gtaccttctt 'gacctgcaaa gggtcagtgg 1680 

atcacatctt ctctttctgg acttgtgttc cgcctttcta actcagctga gagttctttg 1740 

agcctgacat gattggcttc ccaagcactc cttgatgcac ccagtggagt ttcaagttac 1800 

atttgagatg tacataatga ccatatataa tcctccagta gattatattt tctggagcat 18 60 

gtaatgtaga .cgttagaggc ttgtcctggt ttgccaagat ggttgttacc taaggtccta 1920 

acaagaggec gcaggcgagc ccagtccaat tagatgatgt. gaaaaatttt tggccttttt '1980 
ttagtctttg gacatgttat caaattaagg aggtcgtgct agcacgatac ccaaactaag 2040 
aaatacccca ttgaaaa'ctg aaaacaaagc ttgctcgttc aaaaaaaaaa aaaaaaaaaa 2100 

aaaaaaa ' 2107 

<210> 6 
<211> 579 
<212> PRT 
<213> Zea mays 

<400> 6 

Ser Thr His Ala Ser Ala His Val Ser Leu Ala Phe Leu Leu Ser Leu. 
1 ' 5 10 15 

Ser Pro Ala Phe Leu Ser Leu Ala Ser Leu Ala Cys Gin Ala His Ser 

20 25 30 

Ala Ser Phe Phe Leu Pro Leu Pro Ser Pro Pro Leu Arg Ala Ser Leu 

35 40 45 



Thr Ser Ser Pro Gly Leu Ala Ala Ala Ala Cys Ser Gly Gly Arg Asp 
50 55 60 
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Pro Ala Arg Ser Val Ser Pro Ala Ala Arg Tyr Thr Met Glu Gly Ala 
65 70 75 80 

Gly Arg Asp Ala Asn Pro Leu Ser Gly Tyr Arg lie Gly Lys Thr Leu 

85 90 95 

Gly lie Gly Ser Phe Gly Lys Val Lys lie Ala Glu His lie Leu Thr 

100 105 110 

Gly His Lys Val Ala lie Lys lie Leu Asn Arg Lys Lys lie Arg Ser 
115 120 . 125 

Met Asp Met Glu Glu Lys Val Lys Arg Glu lie Lys lie Leu Arg Leu 
130 '135 140 

Phe Met His Pro His lie lie Arg Leu Tyr Glu Val lie Asp Thr Pro, 
145 150 155 . 160 

Ala Asp lie -Cys Val Val Met Glu Tyr Val Lys Ser Gly Glu Leu Phe 

165 ^ 170 175 

Asp Tyr . lie Val Glu Lys Gly Arg Leu His Glu Glu Glu Ala Arg His 

180 185 190 

Phe -Phe Gin Gin lie lie Ser Gly Val Glu Tyr Cys His Arg Asn Met 
195 200 . 205 

Val Ala- His Arg Asp Leu Lys Pro Glu Asn Leu Leu Leu Asp . Ser Lys 
210 215 220 

Cys Asn Val Lys lie Ala Asp Phe Gly Leu . Ser Asn lie Met Arg Asp 
225 ^ 230 ' 235 240 

Gly His Phe Leu Lys Thr Ser Cys Gly Ser Pro Asn Tyr Ala Ala Pro 

245 250 255 

Glu Val He Ser Gly Lys Leu Tyr Ala Gly Pro Glu Val Asp Val Trp 

260 265 270 

Ser Cys Gly Val He Leu Tyr Ala Leu Leu Cys Gly Thr Leu Pro Phe 
275 . , 280 285 

Asp Asp Glu Asn He Pro Asn Leu Phe Lys Lys He Lys Gly Gly He 
2 90 295 300 

Tyr Thr Leu Pro Ser His Leu Ser Pro Ser Ala Arg Asp Leu He Pro 
305 310 315 320 

Arg Met Leu Val Val Asp Pro Met Lys Arg He Thr He Arg Glu He 

325 330- 335 

Arg Glu His Val Trp Phe Lys He Arg Leu Pro Arg Tyr Leu Ala Val 

340 345 350 

Pro Pro Pro Asp Thr Ala Gin Gin Val Lys Lys Val Asp Glu Glu Thr 
355 360^ 365 

Leu Asn Asp Val He Lys Met Gly Phe Asp Lys Asn Gin Leu He Glu 
370 375 380 
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Ser Leu Gin Asn Arg Leu 
385 390 

Leu Leu Asp Asn Arg Leu 

405 

Phe Gin Glu Ser Met Asp 

420 

Pro Thr Ser Ala Thr Glu 
435 ■ 

Gly Ser Gly Leu Arg Gin 
450 

Gly Leu Gin Ser Arg Ala 
465 470 

Lys Ala Leu Gin Glu Leu 

485 

Asn Met Lys Cys Arg Trp 

500 

Asn Ser Asp Ser Phe Ser 
515 

Phe Met Glu Lys Ser Thr 
530 

Lys Thr Arg Asp Glu Lys 
545 550 

Ser His Leu Leu Phe Leu 

565 

Arg Val Leu 



Gin Asn Glu Ala Thr 

395 

Arg Thr Thr Ser Gly 

410 

Ser Ser Leu Ser Gin 
425 

Leu Arg Gin His Gly 
440 

His Phe Ala Ala Glu 
455 

His Pro Arg Glu lie 

475 

Asn Val Tyr Trp Lys 

490 

Ser Pro Gly Cys Leu 
505 

Ala Glu Ser Ala lie 
520 

Pro Thr Val Lys Phe 
535 

Tyr Leu Leu Asp Leu 

555 

Asp Leu Cys Ser Ala 

'570 



Val Ala Tyr Tyr Leu 

400 

Tyr Leu Gly Ser Glu 

415 

Val lie Ala Glu Thr 
430 

Phe Ser Glu Ser Pro 
445 

Arg Lys Trp Ala Leu 
460 

lie Ser Glu Val Leu 

480 

Lys lie Gly His Tyr 

495 

Glu Ser Met Met His 
510 

lie Glu Thr Asp Val 
525 

Glu lie Gin Leu Tyr 
540 

Gin Arg Val Ser Gly 

560 

Phe Leu Thr Gin Leu 

575 



<210> 7 

<211> 530 

<212> DNA 

<213> Oryza sativa 

<220> 

<221> unsure 

<222> (466) 

<220> 

<221> unsure 

<222> (503) 



<220> 

<221> unsure 
<222> (507) 

<400> 7 

gagggtaggg gactacgtgc tggtgcggca gatcgggtcg ggggcgtacg cgcgggtctg 60 

gctcgggaag caccgcacgc ggggcacgga ggtggccttg aaggagatcg ccgtggagcg 120 

gcttagcagc aagctccgcg agagcctcct ctccgaggtc gacatcctcc ggcgcatccg 180 
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tcatcccaac gtcatcgccc tccacgagtc catcagggat ggtgggaaaa tatatcttgt 240 

attagagtac tgtcgaggtg gtgacttaca ctcatacctt cagcagcata aaagggtttc 300 

tgaaacagtt gctaagcatt tcatccagca gctagcatct ggtctgcaga tgctgcgtga 360 

aaacaacgtg gttcatcgag atctaaaaac cacagaaatt cttctaattg caaataatga 420 

aaatctcccc ttgaagattg cggactttgg atttgcaaat ttttanaacc ttcttytctg 480 

ggctgaaaac actttgcggt tcncgcntta aaagggtcca aaagtcatca 530 

<210> 8 

<211> 151 

<212> PRT 

<213> Oryza sativa 



<400> 8 

Val Gly Asp Tyr Val 
1 5 

Arg Val Trp Leu Gly 

20 

Lys Glu lie Ala Val 

35 

Leu Ser Glu Val .Asp 
50 

Ala Leu His Glu Ser 
65 

Glu Tyr Cys Arg Gly 

85 

Arg Val Ser Glu Thr 

100 

Gly Leu Gin Met Leu 
115 

Thr Thr Glu lie Leu 
130 

lie Ala Asp Phe Gly 
145 



Leu Val Arg Gin lie 

10 

Lys His Arg Thr Arg 

25 

Glu Arg. Leu Ser Ser 

40 

lie Leu Arg Arg lie 
55 

lie Arg Asp Gly Gly 
"70 

Gly Asp Leu His Ser 

90 

Val Ala Lys His Phe 

105 

Arg Glu Asn Asn Val 
120 

Leu lie Ala Asn Asn 
135 

Phe Ala 
150 



Gly Ser Gly Ala Tyr Ala 

15 

Gly Thr Glu Val Ala Leu 

30 

Lys Leu Arg Glu Ser Leu 

45 

Arg His Pro Asn Val lie 
60 

Lys lie Tyr Leu Val Leu 
75 80 

Tyr Leu Gin Gin His Lys 

95 

lie Gin Gin Leu Ala Ser 

110 

Val His Arg Asp Leu Lys 
125 

Glu Asn Leu Pro Leu Lys 
140 



<210> 9 

<211> 1848 

<212> DNA 

<213> Glycine max 

<400> 9 — 

ggaatgcaga gaaatgagaa ttgggcattg gctcagtcct tctgaggcag gtggcgctgc 60 

caaacgcaga acctccggca gaaattgcca ccgcaaacac gtiggaagacg ggtagtagta 120 

ctccatctcc cattctctca tattcccttt actaattaaa gatcatgaac agcgagagac 180 

agtcgacgac gacgacgttg ctgcacggca agtacgagct aggtcgtgtg ctggggcacg 240 

gaagcttcgc caaggtctac cacgcgcgga acctgaagac ggggcagcac gtggctatga 300 

aggtcgttgg aaaagaaaag gtgatcaagg tcggaatgat ggagcaggtc aagagggaga 360 

tctcggtcat gaagatggtc aagcacccaa acatcgtcga gctccacgaa gtcatggcca 420 

gtaagtccaa gatctacatc tccatcgaac tcgtccgcgg cggagagctc ttcaacaagg 480 

tctccaaggg acgcttgaag gaggacctgg ccagactcta cttccagcag ttaatctctg 540 

ccgtcgattt ctgccacagc cgcggcgtct accaccgtga cctcaagccg gagaatctcc 600 

tcctggacga acacggcaac ctcaaggtct ccgacttcgg actcaccgcc ttctccgacc 660 
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accttaaaga ggacgggctg tracacacca cgtgcggcac gcctgcgtac gtgtcaccgg 720 
aagttattgc aaagaaaggc tatgacggtg ccaaggctga tatatggtca tgcggagtaa 780 
tcctctacgt tctcctagct gggtttttac cctttcagga tgataatttg gttgccatgt 840 
ataaaaaaat ccatagaggg gacttcaagt gcccaccgtg gttctctctc gatgcgagaa 900 
agcttgttac gaaactgctt gatccgaatc cgaatacgag gattagtatt tctaaagtga 960 
tggagtcttc ttggtttaag aaacaggtgc cgaggaaggt ggaggaggtg gttgagaagg 1020 
tcgatttgga ggagaagatt gagaatcagg agacgatgaa tgctttccac ataatctcct 1080 
tgtcggaggg gttcaatttg tcgccgttgt tcgaggagaa gaggaaggag gagatgaggt 1140 
tcgccaccgc ggggacgccg agcagcgtca tttcgcgcct ggaggaggtg gccaaggcgg 1200 
ggaagtttga cgtgaagagt agcgagacga aagtgaggct tcagggacag gagcgtggga 1260 
ggaaggggaa gctagcgatt gcggcggata tctacgccgt gacgccgtcg tttatggtgg 1320 
tggaggtcaa gaaggacaat ggggatacgt tggagtataa ccagttctgc agcaaacaac 1380 
ttcgtcctgc gcttaaggat atcttctgga attctgcacc tgccagtgct tgaatggtat 1440 
attatatata tgatgtctga ttattgtttt acaacaccat ttgtcgacct tgttatttta 1500 
ttttattttt gtttaggaac tttgagtttt gttgataaga tctgtttgtg ttttgttgtt 1560 
taaattgtta gtttggtact tggggactgg ttagtgggta cttactctct ctgtcgccca 1620 
gaattgttgt taagagatgc acacgttgtg gatgattctg tcatgtttgt gtttcaatta 1680 
tgagtgtgta atttggagtg cttgtggggc ttactgcgac tttgttcaaa gttacttttt 1740 
acctttttct tttctttatt ttattccttc tagattatgt cctctgtgtt cacgggtcac 1800 
gcctgtacct aatacactag gaattcgact aaaaaaaaaa aaaaaaaa 1848 

<210> 10 

<211> 422 

<212> PRT 

<213> Glycine max 

<400> 10 

Met Asn Ser Glu Arg Gin Ser Thr Thr Thr Thr Leu Leu His Gly Lys 
1 5 10 15 

Tyr Glu Leu Gly Arg Val Leu Gly His Gly Ser Phe Ala Lys Val Tyr 

20 . 25 30 ' 

His Ala Arg Asn Leu Lys Thr Gly Gin His Val Ala Met Lys Val Val 

35 4 0 . 4 5 

Gly Lys Glu Lys Val lie Lys Val Gly Met Met Glu Gin Val Lys Arg 
50 55 * 60 

Glu lie Ser Val Met Lys Met Val Lys His Pro Asn lie Val Glu Leu 
65 70 75 80 

His Glu Val Met. Ala Ser Lys Ser Lys. lie Tyr. lie Ser lie Glu Leu 

85 90 95 

Val Arg Gly Gly Glu Leu Phe Asn Lys Val Ser Lys Gly Arg Leu Lys 

100 105 110 

Glu Asp Leu Ala Arg Leu Tyr Phe Gin Gin Leu lie Ser Ala Val Asp 
115 120 125 

Phe Cys His Ser Arg Gly Val Tyr His Arg Asp Leu Lys Pro Glu Asn 
130 135 140 

Leu Leu Leu Asp Glu His Gly Asn Leu Lys Val Ser Asp Phe Gly Leu 
145 150 155 160 

Thr Ala Phe Ser Asp His Leu Lys Glu Asp Gly Leu Leu His Thr Thr 

165 170 175 
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Cys Gly Thr Pro Ala Tyr Val Ser Pro Glu Val lie Ala Lys Lys Gly 

180 185 190 

Tyr Asp Gly Ala Lys Ala Asp lie Trp Ser Cys Gly Val lie Leu Tyr 
195 200 205 

Val Leu Leu Ala Gly Phe Leu Pro Phe Gin Asp Asp Asn Leu Val Ala 
210 215 220 

Met Tyr Lys Lys lie His Arg Gly Asp Phe Lys Cys Pro Pro Trp Phe 
225 230 235 240 

Ser Leu Asp Ala Arg Lys Leu Val Thr Lys Leu Leu Asp Pro Asn Pro 

245 250 255 

Asn Thr Arg lie Ser lie Ser Lys Val Met Glu Ser Ser Trp Phe Lys 

260 265 270 

Lys Gin Val Pro Arg Lys Val Glu Glu Val Val Glu Lys Val Asp Leu 
275 280 285 

Glu Glu Lys lie Glu Asn Gin Glu- Thr Met Asn Ala Phe His lie lie 
290 295 300 

Ser Leu Ser Glu Gly Phe Asn Leu Ser Pro Leu Phe Glu Glu Lys Arg 
305 . ^ 310 315 - 320 

Lys Glu Glu Met Arg Phe Ala Thr Ala Gly Thr Pro Ser Ser Val lie 

325 330 335 

Ser Arg Leu Glu Glu Val Ala Lys Ala Gly Lys Phe Asp Val Lys Ser 

340 345 350 

Ser Glu Thr Lys Val Arg Leu Gin Gly Gin Glu Arg Gly Arg Lys Gly 
355 ^ 360 365 

Lys Leu Ala lie Ala Ala Asp lie Tyr Ala Val Thr Pro Ser Phe Met 
370 375 380 

Val Val Glu Val Lys Lys Asp Asn Gly Asp Thr Leu Glu Tyr Asn Gin 
385 390 395 400 

Phe Cys Ser Lys Gin Leu Arg Pro Ala Leu Lys Asp lie Phe Trp Asn 

405 410 415 

Ser Ala Pro Ala Ser Ala 

420 

<210> 11 

<211> 2123 

<212> DNA 

<213> Glycine max 

<400> 11 

cgcgatgctt tgttcctctt ttgttcttcc ccttcccaat cttcaacttt agggttcctt 60 

taatctcgtt attcctcttc actcccaagc tccgttcact ccaacaacac tccgatttag 120 

aaatggatgg accagctggc cgaggtggtg ctggcctgga catgtttcta ccaaattata 180 

aattgggaaa aacactcggg attggatctt ttggcaaggt gaaaattgca gaacatgtgt 240 

tgactggcca taaggttgcg atcaagatcc ttaaccgacg caagataaag aacatggaaa 300 

tggaagaaaa agtgagaaga gaaatcaaaa ttttaagatt gttcatgcat cctcacatta 360 

11 
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ttcgacttta tgaagtcata gaaactccaa ctgacatata tgttgtcatg gagtatgtga 420 

agtctggaga gcttttcgat tacatagtag agaagggtag gttgcaggaa gatgaagctc 480 

gtaatttttt tcagcagata atctctgggg tggagtactg tcacaggaat atggtggttc 540 

atagagattt gaagcctgag aatttacttt tggactccaa atgtaatgtc aagattgctg 600 

attttggctt gagcaacatc atgcgtgatg gtcactttct taaaacaagt tgtggaagcc 660 

ctaactatgc agctcctgag gttatctctg ggaaattgta tgctggacct gaagtggatg 720 

tctggagctg tggtgtaatt ttatatgccc ttctttgtgg cacccttcct tttgatgatg 780 

aaaatattcc aaatctcttc aagaaaataa agggtgggat ttacactctt cccagtcatc 840 

tatcacccgg tgctagagat ttgataccag ggatgcttgt ggttgaccct atgaggagaa 900 

tgaccatacc tgagatccgt caacacccat ggttccaagc tcgacttcca cgttatttag 960 

ctgtgccacc accagataca atgcaacagg ccaaaaagat tgatgaggag atccttcagg 1020 

aagtggtgaa aatgggattt gacaggaatc aattggttga atctcttggg aacaggatac 1080 

aaaatgaggg tactgtggca tactatttgt tattggacaa ccgatttcgt gtttccagtg 1140 

gctatcttgg agctgagttt caagagacca tggattccgg ttttaatcaa atgcattcca 1200 

gtgaacttgc ttcttcagtt gttggaaacc gctttccagg ctacatggaa tatccaggag 1260 

taggatcgag gcaacagttc cctgttgaaa ggaaatgggc ccttgggctt cagtctcgag 1320 

cccatcctcg tgaaataatg actgaggttc ttaaagcttt gcaagaatta aatgtttgtt 1380 

ggaagaagat tggtcactac aacatgaagt gtaggtgggt tgctggcatt • cctggtcacc 1440 

acgaaggaat ggttaacaat aatgtgcata gtaatcatta ctttggagat gattccaaca 1500 

ttattgagaa . tgatgctgtt tctacttcaa atgtggtcaa gtttgaagtg cagctttaca 1560 

aaacccggga agaaaagtat ctgcttgatc ttcaaagggt gcagggtcca cagtttcttt 1620 

tcttggatct atgtgctgct ttccttgcac agcrtcgtgt cctctagagc gaagagctta 1680 

ggctcaccag agacataaat atgccttgtg tttcatgrga ataagcttac attgtacata 1740 

caagtatctt ttcggtccaa gcttatcctc tgttttctac ccgtttatct gtgggtaaac 1800 

attataatga gccaatccca agtatgggat ttgcttcgtt aacaatggtg ctcgactagg 1860 

ttttcctttc tcttggtgat tttaatttta tcagtgtaat gaattatatg gggaatgtcr 1920 

taaaagaatt agccccaact gacagtgata tatgtatggt tatcaaccaa atgctgtaaa 1980 

cgactgttgc cattggaagg gatgtctagg ctgcttctaa cttttgcaga ttagtctttg 204 0 

ttgtcctata aatgtaattt catattactc gtgtatgaat gctatgtaac gactgtgttc 2100 

atccgaaaaa aaaaaaaaaa aaa 2123 

<210> 12 . ' ' 

<2ii> 514 : ^ 

<212> PRT / 
<213> Glycine max 

<400> 12 

Met Asp Gly , Pro Ala Gly Arg Gly Gly Ala Gly.Leu Asp MetPhe Leu 
1 .5 10 15 

Pro Asn Tyr Lys Leu Gly Lys Thr Leu Gly lie Gly Ser Phe Gly Lys 

20 25 , 30 

Val Lys lie' Ala Glu His Val Leu Thr Gly His Lys Vai Ala He Lys 

35 40 / " 45 

He Leu Asn Arg Arg Lys lie Lys Asn Met Glu Met Glu Glu Lys Val 
50 55 60 

Arg Arg Glu He Lys He Leu Arg Leu Phe Met His Pro His He He 
65 70 75 80 

Arg Leu Tyr Glu Val He Glu Thr Pro Thr Asp He Tyr Val Val Met . 

85 . 90 95 

Glu Tyr Val Lys Ser Gly Glu Leu Phe Asp Tyr He Val Glu Lys Gly 

100 105 110 

Arg Leu Gin Glu Asp Glu Ala Arg Asn Phe Phe Gin Gin He He Ser 
115 120 125 
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Gly Val Glu Tyr Cys His Arg Asn Met Val Val His Arg Asp Leu Lys 
130 135 140 

Pro Glu Asn Leu Leu Leu Asp Ser Lys Cys Asn Val Lys lie Ala Asp 
145 150 155 160 

Phe Gly Leu Ser Asn lie Met Arg Asp Gly His Phe Leu Lys Thr Ser 

165 170 175 

Cys Gly Ser Pro Asn Tyr Ala Ala Pro Glu Val lie Ser Gly Lys Leu 

180 185 190 

Tyr Ala Gly Pro Glu Val Asp Val Trp Ser Cys Gly Val lie Leu Tyr 
195 200 205 

Ala Leu Leu Cys Gly Thr Leu Pro Phe Asp Asp Glu Asn lie Pro Asn 
210 215 220 

Leu Phe Lys Lys lie Lys Gly Gly lie Tyr Thr Leu Pro Ser His Leu 
225 230 235 240 

Ser Pro Gly Ala Arg Asp Leu lie Pro Gly Met Leu Val Val Asp Pro 

245 250 255 

Met Arg Arg Met Thr lie Pro Glu lie Arg Gin His Pro Trp Phe Gin 

260 265 270 

Ala Arg Leu Pro Arg Tyr Leu Ala Val Pro Pro Pro Asp Thr Met Gin 
275 280 285 

Gin Ala Lys Lys lie Asp Glu Glu lie Leu Gin Glu Val Val Lys Met 
290 295 300 

Gly Phe Asp Arg Asn Gin Leu Val Glu Ser Leu Gly Asn Arg lie Gin 
305 310 ' 315 - 320 

Asn Glu Gly Thr Val Ala Tyr Tyr Leu Leu Leu Asp Asn Arg Phe Arg 

325 330 335 

Val Ser Ser Gly Tyr Leu Gly Ala Glu Phe Gin Glu Thr Met Asp Ser 

340 345 350 

Gly Phe Asn Gin Met His Ser Ser Glu Leu Ala Ser Ser Val Val Gly 
355 360 365 

Asn Arg Phe Pro Gly Tyr Met Glu Tyr Pro Gly Val Gly Ser Arg Gin 
370 375 380 

Gin Phe Pro Val Glu Arg Lys Trp Ala Leu Gly Leu Gin Ser Arg Ala 
385 390 395 400 

His Pro Arg Glu lie Met Thr Glu Val Leu Lys Ala Leu Gin Glu Leu 

405 410 415 

Asn Val Cys Trp Lys Lys He Gly His Tyr Asn Met Lys Cys Arg Trp 

420 425 430 

Val Ala Gly lie Pro Gly His His Glu Gly Met Val Asn Asn Asn Val 
435 440 445 
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His Ser Asn His 
450 

Ala Val Ser Thr 
465 

Thr Arg Glu Glu 



Gin Phe Leu Phe 

500 

Val Leu 



Tyr Phe Gly Asp 
455 

Ser Asn Val Val 
470 

Lys Tyr Leu Leu 
485 

Leu Asp Leu Cys 



Asp Ser Asn lie 

460 

Lys Phe Glu Val 
475 

Asp Leu Gin Arg 
490 

Ala Ala Phe Leu 
505 



lie Glu Asn Asp 



Gin Leu Tyr Lys 

480 

Val Gin Gly Pro 
495 

Ala Gin Leu Arg 
510 



<210> 13 

<211> 2040 

<212> DNA 

<213> Glycine max 



<400> 13 

acttagtgct attcaataat tacgaaaaac 
ggctggacca gggaatgaat agagatgaga 
ggcgaagcca aacgcagaat cggcggcaga 
tgaaccaaac acaaaattcc atccaccact 
aagaaaatct caatttctaa attgtaattc 
aacaattaga aagtagtaga gtgttttttt 
tgggtgagaa gagcaacgtt ggcggcgacg 
acgagctggg ccggctgtta ggccacggaa 
tgaagacggg aaagagcgtg gcgatgaagg 
gcatgatgga gcagatcaag agagagatct 
tcgtgcagct tcacgaggtc atggcgagca 
tccgcggcgg cgagctcttc aacaagatcg 
gactttactt ccaacaactc atctctgctg 
accgtgacct caagccagag aaccttctcc 
acttcggcct cagcactttc tccgagcacc 
gcggcacgcc ggcctatgtc gcgcccgagg 
aggccgatar ttggtcttgt ggcgtaatcc 
tccaggatga caacctggtg gccttgtaca 
cgccgtggtt ctcctccgag gcgcggaggc 
acacgcgaat cacaatttca aaaatcatgg 
agaacttgat gggaaagaaa agagaggaat 
agcaggaggt ttctactacg atgaatgctt 
.atttgtcgcc gttgttcgag gaaaagaaga 
cgcgccccgc gagcagcgtg atttcgagat 
acgtgaagaa gagcgagact aaggtgaggt 
aactcgctat tgctgcggac ttgtacgccg 
agaaggacaa cggtgacacc ttggagtata 
ctcttaaaga tatcgtgtgg aggacttctc 
caaccaagac tcccatgcta attgtcgttc 
caggactgag atattttagt tatttgcttg 
tgtcgtcttt taccccgaaa ggggaattgt 
tggacatcga ggagttgttg tcgatgatga 
tcgtgggtga tgttgttgct atgttggcgt 
gtggctcatg ggtcattcat ggctgcgttt 



caacagcgag tgttgttttt gtttggtcca 60 

ggctttcggt tgaagagaac tgagttaagc 120 

aattgccacc ggaagcacgt cgaagacggg 180 

tctcttttca atccccaact caaatctgtg 240 

ttagcactac tgctagtagt agtaaaaaaa 300 

ttcttttttt cgtttgtttt acttttaaga 360 

cgattaacac gacactgctt cacgggaagt 420 

ccttcgcgaa ggtgtaccac gcgcgccacc 4 80 

tggtgggcaa agagaaggtg gtgaaggtag 54 0 

cagc'catgaa catggtgaag cacccaaaca 600 

agtccaagat ctacatcgcc atggagctcg 660 

ccagaggccg tctccgggag gagatggcca 720 

tggacttctg ccacagccgc ggcgtctacc 780 

tcgacgacga cggcaacctc aaggtcaccg 840 

tccggcacga cggcctgctg cacactacgt 900 

tgattgggaa aagaggctac gacggcgcca 960 

tctacgtact tcttgctggt tttttaccct 1020 

agaaaattta ccgcggtgac ttcaagtgtc 1080 

tcatcaccaa gctcctcgac cccaacccga 1140 

actcgtcgtg gttcaagaaa cccgtgccga 1200 

tggatctgga ggagaagatt aaacagcatg 1260 

ttcacatcat ttcgctctcg gaggggttcg 1320 

gagaagagaa agagttacgg tttgcgacca 1380 

tggaagattt ggcgaaggcg gtgaagttcg 1440 

tgcagggtca agaaaaaggg cgaaaaggta 1500 

tgacgccgtc gtttttggtg gtggaggtta 1560 

accagttctg tagcaaggag cttcgtccag 1620 

ctgcagagaa tcctacactc gcttgaagaa 1680 

ctttgtagtt gtttttgtta attcttgttt 1740 

cttctttgtt atgttttggc ctctggggtg 1800 

tgaaaaggcg aatgcagtaa tgaaccgaaa 1860 

ttcaggtaca tgtgctgcgt gttgttgtgc 1920 

gattgtgaat atgtttgcaa attgattggt 1980 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2040 



<210> 14 
<211> 438 
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<212> PRT 

<213> Glycine max 

<400> 14 

Met Gly Glu Lys Ser Asn Val Gly Gly Asp Ala lie Asn Thr Thr Leu 
15 10 15. 

Leu His Gly Lys Tyr Glu Leu Gly Arg Leu Leu Gly His Gly Thr Phe 

20 25 30 

Ala Lys Val Tyr His Ala Arg His Leu Lys Thr Gly Lys Ser Val Ala 

35 40 45 



Met Lys Val Val Gly Lys Glu Lys Val Val Lys Val Gly Met Met Glu 
50 55 60 

Gin lie Lys Arg Glu lie Ser Ala Met Asn Met Val Lys His Pro Asn 
65 70 75 80 

lie Val Gin Leu His Glu Val Met Ala Ser Lys Ser Lys lie Tyr lie 

85 90 95 

Ala Met Glu Leu Val Arg Gly Gly Glu Leu Phe Asn Lys lie Ala' Arg 

100 105 110 

Gly Arg Leu Arg Glu Glu Met Ala Arg Leu Tyr Phe Gin Gin Leu lie 
115 120 125 

Ser Ala Val Asp Phe Cys His Ser Arg Gly Val Tyr His Arg Asp Leu 
130 135 . 140 

Lys Pro Glu Asn Leu Leu Leu Asp Asp Asp Gly Asn Leu Lys Val Thr 
145 150 155 160 

Asp Phe Gly Leu Ser Thr Phe Ser Glu His Leu Arg His Asp Gly Leu 

165 170 175 

Leu His Thr Thr Cys Gly Thr Pro Ala Tyr Val Ala Pro Glu Val lie 

180 185 190 . ' 

Gly Lys Arg Gly Tyr Asp Gly Ala Lys Ala Asp lie Trp Ser Cys Gly 
195 200 205 

Val lie Leu Tyr Val Leu Leu Ala Gly Phe Leu Pro Phe , Gin Asp Asp 
210 215 220 

£ 

Asn Leu Val Ala Leu Tyr Lys Lys lie Tyr Arg Gly Asp Phe Lys Cys 
225 . 230 235 240 

Pro Pro Trp Phe Ser Ser Glu Ala Arg Arg Leu lie Thr Lys Leu Leu 

245 250 255 

Asp Pro Asn Pro Asn Thr Arg lie Thr lie Ser Lys lie Met Asp Ser 

260 265 270 

Ser Trp Phe Lys Lys Pro Val Pro Lys Asn Leu Met Gly Lys Lys Arg 
275 280 285 

Glu Glu Leu Asp Leu Glu Glu Lys lie Lys Gin His Glu Gin Glu Val 
290 295 300 
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Ser Thr Thr Met Asn Ala Phe His He lie Ser Leu Ser Glu Gly Phe 
305 310 315 320 

Asp Leu Ser Pro Leu Phe Glu Glu Lys Lys Arg Glu Glu Lys Glu Leu 

325 330 335 

Arg Phe Ala Thr Thr Arg Pro Ala Ser Ser Val He Ser Arg Leu Glu 

340 . 345 350 

Asp Leu Ala Lys Ala Val Lys Phe Asp Val Lys Lys Ser Glu Thr Lys 
355 360 365 

Val Arg Leu Gin Gly Gin Glu Lys Gly Arg Lys Gly Lys Leu Ala He 
370 375 380 

Ala Ala Asp Leu Tyr Ala Val Thr Pro Ser Phe Leu Val Val Glu Val 
385 390 395 400 

Lys Lys Asp Asn Gly Asp Thr Leu Glu Tyr Asn Gin Phe Cys Ser Lys 

405 410 -415 

Glu Leu Arg Pro Ala Leu Lys Asp lie Val Trp Arg Thr Ser Pro Ala 

420 425 . 430' 

Glu Asn Pro Thr Leu Ala 
435 

<210> 15 
<2l"l> 2543 
<212> DNA 

<213> Glycine max * 
<400> 15 

caatgctctc ttcttccatt ttccttctca atccaccact ctaaccctcg cacgcttcgt 60 
tcaattacaa aaatggacag atcaactggc cgtggtggtg gtggaagtgt ggacatgttt 120 
ctccgaaatt ataagttggg aaaaacactc ggcattgggt cctttggcaa ggtgaaaatt 180 
gctgagcatg tacggactgg t-cataaagtt gctataaaga tccttaaccg ccacaagatt 240 
aaaaacatgg aaatggaaga aaaagttaga agagaaatca aaatttt'aag attgtttatg 300 
catcatcaca ttataagact atatgaggtt gtagaaaccc caacagacat atatgttgtt 360 
atggagtatg tgaaatctgg agagctcttt gattacatag tagagaaggg tcggctgcaa 420 
gaggatgaag cccgtcattt ttttcagcag ataatttctg gtgtggagta ctgtcacagg 480 
aatatggtgg ttcatagaga cctgaagcct gagaatttac tcttggactc aaaatttaac 540 
atcaagattg ctgattttgg gttgagcaac atcatgcgtg atggtcactt tcttaagaca 600 
agttgtggaa gccctaatta tgcggctcca gaggttatct ctggaaaatt gtatgctgga 660 
ccagaagtag atgtctggag ctgtggtgta attttatatg ctcttctctg tggcactctt 720 
ccttttgatg atgaaaacat tcccaatctc ttcaaaaaaa taaagggtgg gatatacact 780 
cttcctagtc atctatcacc tggtgctaga gatttgatac caaggatgct tgtggtggat 840 
cccatgaaga ggatgaccat acctgagata cgccaacacc catggttcca agttcatcta 900 
ccgcgttatt tagcagtgcc accaccagat acactgcaac aagccaaaaa gattgatgag 960 
gagattcttc aggaagtggt taatatggga tttgacagga atcaattggt tgaatctctt 1020 
agcaacagga tacaaaatga gggtactgta acatactatt tgttattgga caaccggttt 1080 
cgtgtttcta gtggttatct tggagctgaa tttcaagaga caatggattc tggttttaac 1140 
cgtatgcatt ccggcgaagt tgcttctcca gttgttggac accacagcac agggtatatg 1200 
gattatcaag gggtaggaat gcggcaacag ttccctgttg agagaaaatg ggcccttggg 1260 
cttcagtctc gagcccaacc acgtgaaata atgactgagg tccttaaagc tctacaagaa 1320 
ttaaatgttt gttggaagaa gattggacac tataacatga agtgcagatg ggttgctggc 1380 
actgctggtc atcatgaagg aatgattaac aattctctgc atagtaatca ttactttgga 1440 
aatgattccg gcattattga aaatgaagct gtttctaagt caaatgtggt caagtttgaa 1500 
gtgcagcttt acaaaactcg tgaggagaaa tatctgcttg atcttcaaag ggtccagggc 1560 
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ccacagtttc ttttcttgga tctgtgcgct 
tgaagctcac aagtcgcaaa ggagaccttg 
acgttgtaca tatcaatgcc ctttcagtat 
atacccccag tttttatggg ttatcatgat 
cttaatcatg gtgtctgact aggtctgtct 
cgtgaaatcc ctgaggattt tatgaaagca 
tattgaattt tgtgttgtaa ttgactactt 
ttgcctaggc tgcatcgaac atgagcttgc 
taattactgc ctcctcttat ggatgcatgt 
aagaaaagcg tcaccgtgca gatccgtgat 
cgcatcgaag ttgcaacaat ggcgagttcc 
aatgcagttc ctttgttggt ggttcttcta 
tggatttata gattggccac tgacaataag 
caacagcaac atcaacagag aacaaaggct 
acaaacgttc tcattcaatt tcattttctt 
tgtgcagtaa aggatatgat ttttttgttt 
gtttaatttt taaaaaaaaa aaa 



gctttccttt cacagctacg tgttctctag 1620 
aggtgtcttg tgtggcatgt gaataagctt 1680 
gggtatatct atctgtttgt actttttagc 1740 
tggggaaacc taaatatgga atttgctttc 1800 
ttttaaaagt gaattgaatt ttcttgatcc 1860 
ttcgcctcca acagagacat ccatttatgg 1920 
atgtctgtca cttggaagcc gaggggttgt 1980 
agataagcat tttctccccg cttccctact 2040 
tatttgacaa aaaaaaaaat ctgaatggaa 2100 
tgagcaaaac gacgtcgtgc tgctacgcat 2160 
gaatcgcaag agcatcatcc atggatctcg 2220 
attgcacttc acgtcttcgc tttggtgtat 2280 
ccacaacagc agcagcagct acaacaacag 2340 
cactgaccga caacaacaac aacaacaaca 2400 
caaacaattg ttgtatgaaa ttgttaattg 24 60 
ttttggtata acagtgatga atgaagtttt 2520 

2543 



<210> 16 

<211> 515 

<212> PRT 

<213> Glycine i 

<400> 16 
Met Asp Arg Ser 
1 

Leu Arg Asn Tyr 

20 

Lys Val Lys lie 
35 

Lys lie Leu Asn 
50 

Val Arq Arg Glu 
65 

He Arg Leu Tyr 



Met Glu Tyr Val 

100 

Gly Arg Leu Gin 
115 

Ser Gly Val Glu 
130 

Lys Pro Glu Asn 
145 

Asp Phe Gly Leu 



Ser Cys Gly Ser 

180 



ax 



Thr Gly Arg Gly 
5 

Lys Leu Gly Lys 



Ala Glu His Val 

40 

Arg His Lys lie 

55 

He Lys He Leu 
70 

Glu Val Val Glu 
85 

Lys Ser Gly Glu 



Glu Asp Glu Ala 

120 

Tyr Cys His Arg 
135 

Leu Leu Leu Asp 
150 

Ser Asn He Met 
165 

Pro Asn Tyr Ala 



Gly Gly Gly Ser 
10 

Thr Leu Gly He 
25 

Arg Thr Gly His 



Lys Asn Met Glu 

60 

Arg Leu Phe Met 

75 

Thr Pro Thr Asp 
90 

Leu Phe Asp Tyr 
105 

Arg His Phe Phe 



Asn Met Val Val 

140 

Ser Lys Phe Asn 
155 

Arg Asp Gly His 
170 

Ala Pro Glu Val 
185 



Val Asp Met Phe 

15 

Gly Ser Phe Gly 
30 

Lys Val Ala He 
45 

Met Glu Glu Lys 



His His His He 

80 

He Tyr Val Val 

95 

He Val Glu Lys 
110 

Gin Gin He He 
125 

His Arg Asp Leu 



He Lys He Ala 

160 

Phe Leu Lys Thr 
175 

He Ser Gly Lys 
190 
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Leu Tyr Ala Gly Pro Glu Val Asp Val Trp Ser Cys Gly Val lie Leu 
195 200 205 

Tyr Ala Leu Leu Cys Gly Thr Leu Pro Phe Asp Asp Glu Asn lie Pro 
210 215 220 

Asn Leu Phe Lys Lys lie Lys Gly Gly lie Tyr Thr Leu Pro Ser His 
225 230 235 240 

Leu Ser Pro Gly Ala Arg Asp Leu lie Pro Arg Met Leu Val Val Asp 

245 250 255 

Pro Met Lys Arg Met Thr lie Pro Glu lie Arg Gin His Pro Trp Phe 

260 265 270 

Gin Val His Leu Pro Arg Tyr Leu Ala Val Pro Pro Pro Asp Thr Leu 
275 280 285 

Gin Gin Ala Lys Lys lie Asp Glu Glu lie Leu Gin Glu Val Val Asn 
290 295 300 

Met Gly Phe Asp Arg Asn Gin Leu Val Glu Ser Leu Ser Asn Arg lie 
305. 310 315 320 

Gin Asn Glu Gly Thr Val Thr Tyr Tyr -Leu Leu Leu Asp Asn Arg Phe 

325 ' 330 ^ 335 

Arg Val Ser Ser Gly Tyr' Leu Gly Ala Glu Phe Gin Glu Thr Met Asp 

340 345 350 

Ser Gly Phe Asn Arg Met His Ser Gly Glu Val Ala Ser Pro Val Val 
355 360 365 

Gly His His Ser Thr Gly Tyr Met Asp Tyr Gin Gly Val Gly Met Arg 
370 375 380 

Gin Gin Phe Pro Val Glu Arg Lys Trp Ala Leu Gly Leu Gin Ser Arg 
385 390 395 • 400 

Ala Gin Pro Arg Glu lie Met Thr Glu Val Leu' Lys Ala Leu Gin Glu 

405 410 415 

Leu Asn Val Cys Trp Lys Lys lie Gly His Tyr Asn Met Lys Cys Arg 

420 425 430 

Trp Val Ala Gly Thr Ala Gly His His Glu Gly Met lie Asn Asn Ser. 
435 440 445 

Leu His Ser Asn His Tyr Phe Gly Asn Asp Ser Gly lie lie Glu Asn 
450 455 * 460 * 

Glu Ala Val Ser Lys Ser Asn Val Val Lys Phe Glu Val Gin Leu Tyr 
465 470 475 480 

Lys Thr Arg Glu Glu Lys Tyr Leu Leu Asp Leu Gin Arg Val Gin Gly 

485 490 495 

Pro Gin Phe Leu Phe Leu Asp Leu Cys Ala Ala Phe Leu Ser Gin Leu 

500 505 510 
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Arg Val Leu 
515 



<210> 17 

<211> 1869 

<212> DNA 

<213> Glycine max 



<400> 17 

gcacgaggtc tggttgcata gcattggttg gtagttgtct caaaaatctc ttcttgccct 60 
ttggccataa tcaaaagcca agacactgtt catacagctg ctcaattatc aagccaacct 120 
tgctcggttc cactgcagaa tttcagttta ttcttatcta gctcaattct ggttgtgggt 180 
ttatctctta ctggaagaca gactttgagg tagactcctt ataagtgcgc agaagttcaa 240 
gtgtagagaa tgagtcagcc taagattaaa cgccgagttg gtaaatacga ggtggggagg 300 
accattggtg aaggtacatt tgcaaaggtg aaatttgcaa ggaactctga gacaggagag 360 
cccgtggctc ttaaaattct tgacaaggag aaggtgctaa agcacaagat ggctgagcag 420 
atcaggagag aagtagctac aatgaaacta atcaagcatc caaatgttgt tcgattgtat 480 
gaggtcatgg gaagcaagac caaaatatat artgttttgg agtttgtaac tgggggggaa 540 
ctctttgaca aaattgtaaa ccatggaagg atgagtgaaa atgaagcacg tagatatttc 600 
cagcagctta taaatgctgt tgattattgc catagcaggg gtgtctacca cagagacctg 660 
aagccagaaa atttgctatt agatacttat gggaacctta aagtttctga ttttggtttg 720 
agtgccctct cccagcaagt tagggatgat ggacttcttc atacracatg tggcactcca 780 
aattatgttg ctcctgaggt ccttaacgat agaggctatg atggggcaac tgcagacttg 840 
tggtcatgtg gggttattct ctttgtattg gttgcaggtt acttgccttt cgacgaccct 900 
aatcttatga acctgtataa aaagatctca gctgctgaat ttacttgccc cccatggctt 960 
tctttcactg ccaggaaatt gattacacga atcttggatc cagatcccac cactcgtatc 1020 
actatacctg agattttgga tgatgaatgg tttaagaaag aatataagcc tcccattttt 1080 
gaggagaatg gggaaatcaa cctcgatgat gttgaagctg tctttaaaga ctctgaagag 114 0 
caccatgtga cagagaaaaa agaagagcag cctacagcca tgaatgcatt tgagttaatc 1200 
tccatgtcca aaggactgaa ccttgaaaac ttgtttgata ctgagcaggg atttaaaagg 1260 
gaaacaagat tcacctcaaa atcccctgcg gatgagataa tcaacaagat tgaggaagcc 1320 
gcaaaacctc ttggctttga tgtgcagaag aaaaattaca agatgaggct tgcaaatgtg 1380 
aaagctggaa ggaagggaaa ccttaatgtt gccacagaga tatttcaagt ggcaccttct 1440 
cttcacatgg tagaggtacg gaaggcaaaa ggagatacat tggagttcca taagttctac 1500 
aagaaacttt caacaagcct ggatgatgtt gtttggaaaa cagaagatga tatgcaaatg 1560 
cgagaaacaa agtgatgtgg atattattat cattgtctat taagtgtaat tttcttcgtg 1620 
tctgaggttt tactattttc caatttcttc attcgttata ttcctccccc gtaggtttgt 1680 
ttggacatta attacatagt actcatttat tgcataccat gctattattt tttgaaagca 1740 
tgcagagttc atgtaagaat tttactcatc caacagtcgc ggttatgttc atgaaacaaa 1800 
aaattgtaag aaatttgtat attgtatata tctatctatt tatatctttt caaaaaaaaa 1860 
aaaaaaaaa 1869 

<210> 18 

<211> 441 

<212> PRT 

<213> Glycine max 

<400> 18 

Met Ser Gin Pro Lys lie Lys Arg Arg Val Gly Lys Tyr Glu Val Gly 
1 • 5 10 15 



Arg Thr lie Gly Glu Gly Thr Phe Ala Lys Val Lys Phe Ala Arg Asn 

20 25 30 

Ser Glu Thr Gly Glu Pro Val Ala Leu Lys lie Leu Asp Lys Glu Lys 
35 40 45 

Val Leu Lys His Lys Met Ala Glu Gin lie Arg Arg Glu Val Ala Thr 
50 55 60 
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Met Lys Leu lie Lys His Pro Asn Val Vai Arg Leu Tyr Glu Vai Met 
65 70 75 80 

Giy Ser Lys Thr Lys lie Tyr lie Vai Leu Giu Phe Vai Thr Giy Giy 

85 90 95 

Giu Leu Phe Asp Lys lie Vai Asn His Giy Arg Met Ser Giu Asn Giu 

100 105 110 

Aia Arg Arg Tyr Phe Gin Gin Leu lie Asn Ala Vai Asp Tyr Cys His 
115 120 125 

Ser Arg Giy Vai Tyr His Arg Asp Leu Lys Pro Giu Asn Leu Leu Leu 
130 135 140 

Asp Thr Tyr Giy Asn Leu Lys Vai Ser Asp Phe Giy Leu Ser Aia Leu 
145 150 155 160 

Ser Gin Gin Vai Arg Asp Asp Giy Leu Leu His Thr Thr Cys Giy Thr 

165 * 170 175 

Pro Asn Tyr Val Aia Pro Giu Vai Leu Asn Asp Arg Giy Tyr Asp Giy 

180 185 190 

Aia Thr Aia Asp Leu Trp Ser Cys Giy Vai lie Leu Phe Vai Leu Vai 
195 200 205 

Ala Giy Tyr Leu Pro Phe Asp Asp Pro Asn Leu Met Asn Leu Tyr Lys 

210 215 • , 220 

Lys lie Ser Aia Aia Glu Phe Thr Cys Pro Pro Trp Leu Ser Phe Thr 
225 230 235 240 

Aia Arg Lys Leu lie Thr Arg lie Leu . Asp Pro ' Asp Pro Thr Thr Arg 

245 250 255 

lie Thr lie Pro Glu lie Leu Asp Asp Giu Trp Phe Lys ■ Lys Giu Tyr 

. 260 265 270 

Lys Pro Pro lie Phe Glu Giu Asn Giy Giu lie Asn Leu Asp Asp Vai 
275 280 285 

Giu Aia Vai Phe Lys Asp Ser Giu Giu His His Vai Thr Giu Lys Lys 
290 295 300 

Giu Giu Gin Pro Thr Aia Met Ash Aia Phe Giu Leu lie Ser Met Ser 
305. 310 315 320 

Lys Giy Leu Asn Leu Giu Asn Leu Phe Asp Thr Giu Gin Giy Phe Lys 

325 330 335 

Arg Giu Thr Arg Phe Thr Ser Lys Ser Pro Ala Asp Glu lie lie Asn 

340 345 350 

Lys lie Giu Giu Aia Aia Lys Pro Leu Giy Phe Asp Vai Gin Lys Lys 
355 360 365 

Asn Tyr Lys Met Arg Leu Aia Asn Val Lys Aia Giy Arg Lys Giy Asn 
370 375 380 
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Leu Asn Val Ala Thr Glu lie Phe 
385 390 

Val Glu Val Arg Lys Ala Lys Gly 

405 

Tyr Lys Lys Leu Ser Thr Ser Leu 

420 



Gin Val Ala Pro Ser Leu His Met 
395 400 

Asp Thr Leu Glu Phe His Lys Phe 
410 415 

Asp Asp Val Val Trp Lys Thr Glu 
425 430 



Asp Asp Met Gin Met Arg Glu Thr Lys 
435 440 

<210> 19 
<211> 817 
<212> DNA 

<213> Triticum aestivum 
<400> 19 

gcgctctcgt cgcctcccca cccattccag cggttgaccg cgggattcgg ccagtgaaaa 60 

tggaagggaa cactagagga ggtgggcatt ctgacgcatt aaagaactac aatgtgggca 120 

gaacattagg tataggcaca tttggaaaag tgaggattgc agagcataag catacagggc 180 

ataaagttgc tataaagatt ctgaaccgtc gtcaaatgag aactatggaa atggaggaga 240 

aagcaaagag agagatcaag atattgaggt tgttcatcca ccctcatatc atccggcttt 300 

atgaggtcat ttacacacct acagatatat ttgttgtgat ggaatattgc aagtatggtg 360 

agctattcga ctgcattgtt gagaaagggc ggttacagga agatgaggct cgtcgaatct 420 

tccagcagat tatatctggt gttgaatact gccacagaaa catggttgct catcgtgatc 480 

taaagccaga gaacctgtta cttgattcca aatacaatgt gaaacttgcc gactttgggt 540 

taagtaatgt catgcatgat ggccattttc tgaagactag ctgcgggagt ccaaactatg 600 

ctgcaccaga ggttatctca ggtaaattat acgctggacc tgaggttgat gtttggagct 660 

gcggggtgat actttatgct cttctttgtg gcactcttcc atttgatgat gacaatattc 720 

ccaaactgtt caaaaagata aagggaggca tctatatcct tccaagtcat ttatctgctc 780 

ctgcaaggga ttgatccaag aatgcttgtt gttgatc ' 817 

<210> 20 

<211> 244 

<212> PRT 

<213> Triticum aestivum 

<400> 20 

Met Glu Gly Asn Thr Arg Gly Gly Gly His Ser Asp Ala Leu Lys Asn 

1 5 10 , 15 

Tyr Asn Val Gly Arg Thr Leu Gly lie Gly Thr Phe Gly Lys Val Arg 

20 25 30 

lie Ala Glu His Lys His Thr Gly His Lys Val Ala lie Lys lie Leu 

35 40 45 

Asn Arg Arg Gin Met Arg Thr Met Glu Met Glu Glu Lys Ala Lys Arg, 
50 55 60 

Glu lie Lys lie Leu Arg Leu Phe lie His Pro His lie lie Arg Leu 
65 70 75 80 

Tyr Glu Val lie Tyr Thr Pro Thr Asp lie Phe Val Val Met Glu Tyr 

85 90 95 . 

Cys Lys Tyr Gly Glu Leu Phe Asp Cys lie Val Glu Lys Gly Arg Leu 

100 105 110 
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Gin Glu Asp Glu 
115 

Glu Tyr Cys His 
130 

Asn Leu, Leu Leu 
145 

Leu Ser Asn Val 



Ser Pro Asn Tyr 

180 

Gly Pro Glu Val 
195 

Leu Cys Gly Thr 
210 

Lys Lys lie Lys 
225 

. Pro Ala Arg Asp 



Ala Arg Arg lie 

120 

Arg Asn Met Val 
135 

Asp Ser Lys Tyr 
150 

Met His Asp Gly 
165 

Ala Ala Pro Glu 



Asp Val Trp Ser 

200 

Leu Pro Phe Asp 
215 

Gly Gly lie Tyr 
230 



Phe Gin Gin lie 



Ala His Arg Asp 

140 

Asn Val Lys Leu 
155 

His . Phe Leu Lys 
170 

Val lie Ser Gly 
185 

Cys Gly Val lie 



Asp Asp Asn lie 

220 

lie Leu Pro Ser 
235 



lie Ser Gly Val 
125 

Leu Lys Pro Glu 



Ala Asp Phe Gly 

160 

Thr Ser Cys Gly 
175 

Lys Leu Tyr Ala 
190 

Leu Tyr Ala Leu 
205 

Pro Lys Leu Phe 



His Leu Ser Ala 

24 0 



<210> 21 
<211> 2006 
<212> DNA 

<213> Triticuin aestivum 
<400> 21 

ctccgcgccg ccgctgccgc tacgcctctc cccgggaagc ctcgccggcg gccaggtgga 60 

agatggagac aggcggcaaa gatggcaacc ctttgaagaa ttaccgtatt gggaagaccc 120 

tggggattgg ttcgttcggg aaggtcaaga ttgccgagca tataaaaact ggtcacaagg. 180 

tggccgtcaa gatccttaac cgccggaaaa tcaaaaacat ggagatggaa gagaaagtga 24 0 

aaagagagat caagatatta agattattca tgcacccaca tatcatccgc ctttatgaag 300 

tgatagaggc accagctgat atttatgtgg ttatggagta tgttaagtct ggtgaattgt 360 

ttgattacat tgttgagaaa ggraggctac aggaggaaga ggcccgccgt ttctttcaac 420 

agatcatatc tggtgttcaa tattgccaca ggaacatggt ggtgcaccgc gatctaaagc 480 

cggagaacct tcttttggac aataattgtg atgttaagat tgcggatttt ggcttaagta 540 

atgttatgcg tgacggccac tttcttaaga caagttgtgg tagcccaaat tatgcagctc 600 

cggaggttat atctggaaaa ctgtacgctg ggcctgaagt tgatgtatgg agctgcggtg 660 

ttattcttta tgctcttcta tgtggtactc ttccatttga tgatgagaac atacccaacc 720 

tttttaagaa aataaagggt ggaatatata cccttccaag ccatttatca ggcccagcaa 780 

gggatttgat tccaaggatg ctagttgttg atcctatgaa gaggataacc attcgtgaaa 840 

tacgcgagca tccatggttt gaagctcaac tcccacgata tttagccgtg cctccaccag 900 

atactgcaca acaagttaaa aagattgatg aagaatctct tgttaaagtt atcagtctgg 960 

gatttgacaa aaacctgctg gttgaatcaa ttcataatag attgcaaaat gaggcaacag 1020 

ttgcatatta tttgtttttg gataataaga gtcgcacaac aactggctat cttggagctg 1080 

ggtatcaaga agctatggaa tcgtctttct cacccattac tccaagtgaa acacaaagtc 1140 

cagctcatgg aaatcggcaa caaccatata tggaatctcc agttggcttg agaccacatt 1200 

ttccagctga taggaaatgg gctcttgggc ttcagtctcg agcacatcca agagaagtta 1260 

tgactgaagt gctgaaggct ctgcaagaac tgaatgtata ctggaaaaaa attggacact 1320 

ataacatgaa atgtagatgg agtcctcctg gctttcccgg tcaggagaat atgaatcata 1380 

ccaattataa cttcagtgca gagcctattg aaaccgacga cctgggtgac aagttaaatt 1440 

taattaagtt cgaacttcag ctttacaaaa caagagatga gaaatacctt ctggatttgc 1500 

aaagggcgag cgggccgcat ctcctctttc ttgatctatg tgccgccttt ctagctcagc 1560 

tgagagtctt ttgataccag atgtgcccga ggaatgtatg ttgtatcact ctaaagagat 1620 
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gtaaatagca agctttctcc agcggarcaa agtcgtggag tatgtagaca tgcggagctg 1680 

ttgtgtgctt atttcggcgc ctatatgctg aatttagacc tggcaggggc gggcaagtga 1740 

agcaagcaag gaactattgc catcaggtta tttccagctg ccgccaaagg cactaggata 1800 

tagaagtatt actgattaat ccratattgg ccccttggga catactccta ctctactgct 1860 

gtttacttgc atgtaatttt tactgtctgg gtctccagac cagaccacgt acacgaataa 1920 

tttcttcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1980 

aaaaaaaaaa aaaaaaaaaa aaaaaa 2006 



<210> 22 

<211> 523 

<212> PRT 

<213> Triticum 

<400> 22 
Pro Arg Arg Arg 
1 

Ala Arg Trp Lys 

20 

Asn Tyr Arg lie 

35 

Lys lie Ala Glu 
50 

Leu Asn Arg Arg 
65 

Arg Glu lie Lys 



Leu Tyr Glu Val 

100 

Tyr Val Lys Ser 
115 

Leu Gin Glu Glu 
130 



aest ivum 

Cys Arg Tyr Ala 
5 

Met Glu Thr Gly 

Gly Lys Thr Leu 

40 

His lie Lys Thr 
55 

Lys lie Lys Asn 
70 

lie Leu Arg Leu 
85 

lie Glu Ala Pro 

Gly Glu Leu Phe 

120 

Glu Ala Arg Arg 
135 



Ser Pro Arg Glu 
10 

Gly Lys Asp Gly 
25 

Gly lie Gly Ser 



Gly His Lys Val 

60 

Met Glu Met Glu 

75 

Phe Met His Pro 
90 

Ala Asp lie Tyr 
105 

Asp Tyr lie Val 



Phe Phe Gin Gin 

140 



Ala Ser Pro Ala 

15 

Asn Pro Leu Lys 
30 

Phe Gly Lys Val 
45 

Ala Val Lys lie 



Glu 'Lys Val Lys 

80 

His lie lie Arg 

95 

Val Val Met Glu 

no' 

Glu Lys Gly Arg 
125 

lie lie Ser Gly 



Val Gin Tyr Cys His Arg Asn Met Val Val His Arg Asp Leu Lys Pro 
145 150 155 160 

Glu Asn Leu Leu Leu Asp Asn Asn Cys Asp Val Lys lie Ala Asp Phe 

165 170 175 

Gly Leu Ser Asn Val Met Arg Asp Gly His Phe Leu Lys Thr Ser Cys 

180 185 190 

Gly Ser Pro Asn Tyr Ala Ala Pro Glu Val lie Ser Gly Lys Leu Tyr 
195 200 205 

Ala Gly Pro Glu Val Asp Val Trp Ser Cys Gly Val lie Leu Tyr Ala 
210 215 220 

Leu Leu Cys Gly Thr Leu Pro Phe Asp Asp Glu Asn lie Pro Asn Leu 
225 230 235 240 
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Phe Lys Lys lie 



Gly Pro Ala Arg 

260 

Lys Arg lie Thr 
275 

Gin Leu Pro Arg 
290 

Val Lys Lys lie 
305 

Phe Asp Lys Asn 



Glu Ala Thr Val 

340 

Thr Thr Gly Tyr 
355 

Phe Ser Pro lie 
370 

Arg Gin Gin Pro 
385 

Pro Ala Asp Arg 



Arg Glu Val Met 

420 

Tyr Trp Lys Lys 
4 35 

Pro Gly Phe Pro 
450 

Ser Ala Glu Pro 
465 

lie Lys Phe Glu 



Leu Asp Leu Gin 

500 

Cys Ala Ala Phe 
515 

<210> 23 
<211> 512 
<212> DNA 
<213> • 2ea mays 



Lys Gly Gly lie 
245 

Asp Leu lie Pro 



lie Arg Glu lie 

280 

Tyr Leu Ala Val 
295 

Asp Glu Glu Ser 
310 

Leu Leu Val Glu 
325 

Ala Tyr Tyr Leu 



Leu Gly Ala Gly 

360 

Thr Pro Ser Glu 
375 

Tyr Met Glu Ser 
390 

Lys Trp Ala Leu 
405 

Thr Glu Val Leu 



lie Gly His Tyr 

440 

Gly Gin Glu Asn 
455 

lie Glu Thr Asp 
470 

Leu Gin Leu Tyr 
485 

Arg Ala Ser Gly 



Leu Ala Gin Leu 

520 



Tyr Thr Leu Pro 
250 

Arg Met Leu Val 
265 

Arg Glu His Pro 



Pro Pro Pro Asp 

300 

Leu Val Lys Val 
315 

Ser lie His Asn 
330 

Phe Leu Asp Asn 
345 

Tyr Gin Glu Ala 



Thr Gin Ser Pro 

380 

Pro Val Gly Leu 
395 

Gly Leu Gin Ser 
410 

Lys Ala Leu Gin 
425 

Asn Met Lys Cys 



Met Asn His Thr 

4 60 

Asp Leu Gly Asp 
475 

Lys Thr Arg Asp 
490 

Pro His Leu Leu 

505 . 

Arg Val Phe 



Ser His Leu Ser 
255 

Val Asp Pro Met 
270 

Trp Phe Glu Ala 
285 

Thr Ala Gin Gin 



lie Ser Leu Gly 

320 

Arg Leu Gin Asn 
335 

Lys Ser Arg Thr 
350 

Met Glu Ser Ser 
365 

Ala His Gly Asn 



Arg Pro His Phe 

400 

Arg Ala His Pro 
415 

Glu Leu Asn Val 
430 

Arg Trp Ser Pro 
445 

Asn Tyr Asn Phe 



Lys Leu Asn Leu 

480 

Glu Lys Tyr Leu 
495 

Phe Leu Asp Leu 
510 
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<400> 23 

gagcagctcc cctgcccctc gcagcggcta 
catagaggga ggaggcgcgg cggagatggt 
ggtgggcaag tacgaggtgg gacgcaccat 
cgcgcagaac accgagaccg gggagagcgt 
cctcaagaac aagatggccg aacagattaa 
gcatcccaat gtcgttaggc tacacgaggt 
tctggagttc atcactggcg gcgagctatt 
tgaagcagat gcccgcagat actttcagca 
gaaaggagtc taccatcgag acttaaagcc 

<210> 24 

<211> 132 

<212> PRT 

<213> Zea mays 



ctctacaggt ctagcgactc tttcgccatc 60 
gggcggtggc ggcggcgggc cgctgcggcg 120 
cggggaaggc accttcgcca aggtcaagtt 180 
cgccatgaag gtgctcgacc gctcctccat 240 
gagagaaata tccataatga agcttgtcag 300 
tttggcaagc cggaagaaga tatttataat 360 
cgataaaatt attcgtcatg ggagactcag 420 
gcttattgat ggtgttgatt tttgtcacaa 480 
tg 512 



<400> 24 

Arg Arg Val Gly Lys Tyr Glu Val Gly Arg. Thr lie Gly Glu Gly Thr 
1 5 10 15 

Phe Ala Lys Val Lys Phe Ala Gin Asn Thr Glu Thr Gly Glu Ser Val 

20 25 30 

Ala Met Lys Val Leu Asp Arg Ser Ser lie Leu Lys Asn Lys Met Ala 
,35 40 45 

Glu Gin lie Lys Arg Glu lie Ser lie Met Lys Leu Val Arg His Pro 
50 55 60 

Asn Val Val Arg Leu His Glu Val Leu Ala Ser Arg Lys Lys lie Phe 
65 70 75 80 

lie lie Leu Glu Phe lie Thr Gly Gly Glu Leu Phe Asp Lys lie lie 

85 . 90 95 

Arg His Gly Arg Leu Ser Glu Ala Asp Ala Arg Arg Tyr Phe Gin Gin 

100 105 110 

Leu lie Asp Gly Val Asp Phe Cys His Lys Lys Gly Val Tyr His Arg 
115 120 125 

Asp Leu Lys Pro 
. 130 

<210> 25 

<211> ,552 

<212> DNA 

<213>. Glycine max 

<220> 

<221> unsure 
<222> (385) 

<400> 25 

gtctggttgc atagcattgg ttggtagttg tctcaaaaat ctcttcttgc cctttggcca 60 

taatcaaaag ccaagacact gttcatacag ctgctcaatt atcaagccaa ccttgctcgg 120 

ttccactgca gaatttcagt ttattcttat ctagctcaat tctggttgtg ggtttatctc 180 

ttactggaag acagactttg aggtagactc cttataagtg cgcagaagtt caagtgtaga 240 

gaatgagtca gcctaagatt aaacgccgag ttggtaaata cgaggtgggg aggaccattg 300 

gtgaaggtac atttgcaaag gtgaaatttg caaggaactc tgagacagga gagccgtggc 360 
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tcttaaaatt cttgacaagg agaangtgct aaagcacaag atggctgagc agatcaggag 420 

agaagtagct acaatgaaac taatcaagca tccaaatgtt gttcgattgt atgaagtcat 480 

gggaagcaag acaaatatat aatgttttgg agttgtactg ggggggaacc cttgcaaatt 540 

gtaaccatgg aa 552 

<210> 26 

<211> 77 

<212> PRT 

<213> Glycine max 

<220> 

<221> UNSURE 
<222> (39) 

<400> 26 

Val Gly Lys Tyr Glu Val Gly Arg Thr lie Gly Glu Gly Thr Phe Ala 
15 10 15 

Lys Val Lys Phe Ala Arg Asn Ser Glu Thr Gly Glu Pro Trp Leu Leu 

20 25 30 

Lys Phe Leu Thr Arg Arg Xaa Val Leu Lys His Lys Met Ala Glu Gin 

. 35 * 40 4 5 



55 . 60 



lie Arg 


Arg Glu 


50 




Val Arg 


Leu Tyr 


65 




<210> 


27 


<211> 


391 


<212> 


DNA 


<213> 


Triticum 


<220> 




<221> 


unsure 


<222> 


(179) 


<220> 




<221> - 


unsure 


<222> 


(236) 


<220> 




<221> 


unsure 


<222> 


(240) 


<220> 




<221> 


unsure 


<222> 


(297) 


<220> 




<221> 


unsure 


<222> 


(316) 


<220> 




<221> 


unsure 


<222> 


(331) 



70 75 
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<220> 

<221> unsure 
<222> (336) 



<220> 

<221> unsure 
<222> (338) 



<220> 

<221> unsure 
<222> (344) 



<220> 

<221> unsure 
<222> (357) 

<220> 

<221> unsure 
<222> (361) 

<220> 

<221> unsure 
<222> (371) 

<220> 

<221> unsure 
<222> (381) 

<220> 

<221> unsure 
<222> (386) 

<400> 27 

ctccgcgccg ccgctgccgc tacgcctctc cccgggaagc ctcgccggcg gccaggtgga 60 

agatggagac aggcggcaaa gatggcaacc ctttgaagaa ttaccgtatt gggaagaccq 120 

tggggattgg ttcgttcggg aaggtcaaga ttgccgagca tataaaaact ggtcacaang 180 

tggccgtcaa gatccttaac cgccggcaaa tcaaaaacat ggcgatggaa gagaangtgn 240 

caagagagat caagatatta agattattca tgcacccaca tatcatccgc ctttatnaag 300 

tgatagaggc accagntgat atttatgtgg ntatgnanta tgtnaaagtc cggtganttg 360 

nttgattata ntgtttctaa ngtctntata t 391 

<210> 28 

<211> 85 

<212> PRT 

<213> Triticum aestivuin 
<220> 

<221> UNSURE 

<222> (29) 



<220> 

<221> UNSURE 

<222> (48) 

<220> 

<221> UNSURE 

<222> (50) 
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<220> 

<221> UNSURE 
<222> (69) 

<220> 

<221> UNSURE 
<222> (75) 

<220> 

<221> UNSURE 
<222> (80) 

<220> 

<221> UNSURE 
<222> (82) 

,<400> 28 

Leu Lys Asn Tyr Arg lie Gly Lys Thr Leu Gly lie Gly Ser Phe Gly 
1 5 10 15 

Lys Val Lys lie Ala Glu His lie Lys Thr Gly His Xaa Val Ala Val 

20 25 30 

Lys lie Leu Asn Arg Arg Gin lie Lys Asn Met Ala Met Glu Glu Xaa 

35 40 45 

Val Xaa Arg Glu lie Lys lie Leu Arg Leu Phe Met His Pro His lie 
50 55 60 

lie Arg Leu Tyr Xaa Val lie Glu Ala Pro Xaa Asp lie Tyr Val Xaa 
65 70 75 80 

Met Xaa Tyr Val Lys 

85 



28 



